言語に依存しない形態素解析処理の枠組

山下 達雄; 松本 裕治

doi:10.5715/jnlp.7.3_39

Abstract

This paper takes up the problem of tokenization and part-of-speech tagging of segmented and non-segmented languages, and proposes a simple framework that enables an efficient and uniform treatment of tokenization for both types of languages. We also reports a language independent morphological analysis system based on the proposed idea, and shows running systems for three different languages, English, Japanese and Chinese.

Content from these authors

Favorites & Alerts

Add to favorites
Additional info alert
Citation alert
Authentication alert

Corresponding author

Register with J-STAGE for free!