Journal of Natural Language Processing
Online ISSN : 2185-8314
Print ISSN : 1340-7619
ISSN-L : 1340-7619
Article
Comparison of Chinese Treebanks for Corpus-oriented HPSG Grammar Development
Kun YuYusuke MiyaoTakuya MatsuzakiXiangli WangYaozhong ZhangKiyotaka UchimotoJunichi Tsujii
Author information
JOURNAL FREE ACCESS

2010 Volume 17 Issue 3 Pages 3_61-3_80

Details
Abstract
Comparing with the traditional way of manually developing grammar based on linguistic theory, corpus-oriented grammar development is more promising. To develop HPSG grammar through the corpus-oriented way, a treebank is an indispensable part. This paper first compares existing Chinese treebanks and chooses one of them as the basic resource for HPSG grammar development. Then it proposes a new design of part-of-speech tags based on the assumption that it is not only simple enough to reduce ambiguity of morphological analysis as much as possible, but also rich enough for HPSG grammar development. Finally, it introduces some on-going work about utilizing a Chinese scientific paper treebank in HPSG grammar development.
Content from these authors
© 2010 The Association for Natural Language Processing
Previous article Next article
feedback
Top