Journal of Natural Language Processing
Online ISSN : 2185-8314
Print ISSN : 1340-7619
ISSN-L : 1340-7619
Language System and Morphological Processing Technique for Korean Computational Processing
KAZUHIDE YAMAMOTO
Author information
JOURNAL FREE ACCESS

2000 Volume 7 Issue 4 Pages 25-62

Details
Abstract
A morpheme and part-of-speech system for Korean natural language processing, or machine translation in particular, is proposed in this paper. We designed this language system for easier computer processing. It is important to attain satisfactory performance when we segment and tag input Korean strings. There is also underand over-classification in a linguistic part-of-speech system for machine translation. Thus we defined an original part-of-speech system, which is demonstrated in this paper with some examples. We based our morphological analysis on the mixed n-gram statistics of both parts-of-speech and words. We tuned up this engine to the Korean language for proper characteristics. Experiments have proven that our engine has 99.1% word recall, 98.9% word precision, and 92.6% sentence accuracy, for unseen Korean strings. In language generation, spacing rules are proposed for Korean using our part-of-speech system. We have proven the appropriateness of our morpheme system in the performance of machine translation for both Japanese-Korean and Korean-Japanese, as shown in (Furuse, Yamamoto, and Yamada, 1999).
Content from these authors
© The Association for Natural Language Processing
Previous article Next article
feedback
Top