サポートベクトルマシンを用いた中国語解析実験

吉田 辰巳; 大竹 清敬; 山本 和英

doi:10.5715/jnlp.10.109

Journal of Natural Language Processing

Online ISSN : 2185-8314
Print ISSN : 1340-7619
ISSN-L : 1340-7619

Performance Evaluation of Chinese Analyzers with Support Vector Machines

TATSUMI YOSHIDA, KIYONORI OHTAKE, KAZUHIDE YAMAMOTO

Author information

Keywords: Chinese analysis, Support Vector Machine, Chunking, YamCha, MOZ

JOURNAL FREE ACCESS

2003 Volume 10 Issue 1 Pages 109-131

DOI https://doi.org/10.5715/jnlp.10.109

Details

Abstract

We will report performances of currently and publicly available Chinese analyzers and resources. We use YamCha, a tool based on Support Vector Machines, and the Penn Chinese Treebank as a language resource. Combining these two, we measure the performances of Chinese analysis, i. e., word segmentation, part-of-speech tagging, and base phrase chunking. In the experiment of word segmentation and part-of-speech tagging, we also report the performance of MOZ, a statistical morphological analyzer, which is also available to the public. We found that the accuracy of morphological analysis using YamCha attains around 88%, which is over 4% higher than that of MOZ, although it is computationally very expensive. We also found that the accuracy for base phrase chunking is approximately 93%.

Content from these authors

Favorites & Alerts

Corresponding author

Register with J-STAGE for free!