IEICE Transactions on Information and Systems
Online ISSN : 1745-1361
Print ISSN : 0916-8532
Regular Section
Minimizing Human Intervention for Constructing Korean Part-of-Speech Tagged Corpus
Do-Gil LEEGumwon HONGSeok Kee LEEHae-Chang RIM
著者情報
ジャーナル フリー

2010 年 E93.D 巻 8 号 p. 2336-2338

詳細
抄録
The construction of annotated corpora requires considerable manual effort. This paper presents a pragmatic method to minimize human intervention for the construction of Korean part-of-speech (POS) tagged corpus. Instead of focusing on improving the performance of conventional automatic POS taggers, we devise a discriminative POS tagger which can selectively produce either a single analysis or multiple analyses based on the tagging reliability. The proposed approach uses two decision rules to judge the tagging reliability. Experimental results show that the proposed approach can effectively control the quality of corpus and the amount of manual annotation by the threshold value of the rule.
著者関連情報
© 2010 The Institute of Electronics, Information and Communication Engineers
前の記事
feedback
Top