自然言語処理
Online ISSN : 2185-8314
Print ISSN : 1340-7619
ISSN-L : 1340-7619
Solving Ambiguities in Indonesian Words by Morphological Analysis Using Minimum Connectivity Cost
Mohammad Teduh UliniansyahShun IshizakiKiyoko Uchiyama
著者情報
ジャーナル フリー

2004 年 11 巻 1 号 p. 3-20

詳細
抄録

The Indonesian language (Bahasa Indonesia) has a number of uncommon characteristics, such as a great amount of derivational affixes. There are so many combinations of affixes and stems in Bahasa Indonesia that ambiguities often arise. To record all words into a word dictionary is almost impossible because it will make the size of the word dictionary huge and processing time very long. We propose a method to analyze the morphology of Indonesian words by using part-of-speech (POS) tagged data, an affix rule table and minimum connectivity costs to solve the problems mentioned above. Experiments showed that our system achieved a good analysis result (more than 97% accuracy).

著者関連情報
© The Association for Natural Language Processing
前の記事 次の記事
feedback
Top