Information and Media Technologies
Online ISSN : 1881-0896
ISSN-L : 1881-0896
Information Systems and Applications
Comparison of Chinese Treebanks for Corpus-oriented HPSG Grammar Development
Kun YuYusuke MiyaoTakuya MatsuzakiXiangli WangYaozhong ZhangKiyotaka UchimotoJunichi Tsujii
Author information
JOURNAL FREE ACCESS

2010 Volume 5 Issue 2 Pages 910-929

Details
Abstract
Comparing with the traditional way of manually developing grammar based on lin- guistic theory, corpus-oriented grammar development is more promising. To develop HPSG grammar through the corpus-oriented way, a treebank is an indispensable part. This paper first compares existing Chinese treebanks and chooses one of them as the basic resource for HPSG grammar development. Then it proposes a new design of part-of-speech tags based on the assumption that it is not only simple enough to re-duce ambiguity of morphological analysis as much as possible, but also rich enough for HPSG grammar development. Finally, it introduces some on-going work about utilizing a Chinese scientific paper treebank in HPSG grammar development.
Content from these authors
© 2010 by The Association for Natural Language Processing
Previous article Next article
feedback
Top