自然言語処理
Online ISSN : 2185-8314
Print ISSN : 1340-7619
ISSN-L : 1340-7619
Reinventing Part-Of-Speech Tagging
Ezra BlackStephen EubankHideki KashiokaDavid MagermanJared SaiaAkira Ushioda
著者情報
ジャーナル フリー

1998 年 5 巻 1 号 p. 3-23

詳細
抄録
Part-of-speech tagging methodology has succeeded, but on problems that may lack real-world application. Redirection of the field is indicated, toward potentially more useful, but harder and more sophisticated tagging tasks: (1) using much more detailed tagsets (semantically and syntactically); (2) testing performance on treebanks reflecting the huge gamut of domains, etc., characterizing real-world applications; (3) understanding the magnitude of the unknown-word and unknown-tag problems, then overcoming them. Tagging results are presented on two versions of a new, highly variegated treebank, featuring tagsets of 2720 and 443 tags, respectively, and utilizing a dictionaryless, decision-tree tagger.
著者関連情報
© The Association for Natural Language Processing
前の記事 次の記事
feedback
Top