Journal of Natural Language Processing
Online ISSN : 2185-8314
Print ISSN : 1340-7619
ISSN-L : 1340-7619
A Method for Detecting Japanese Homophone Errors in Compound Nouns based on Character Cooccurrence and Its Evaluation
MASAHIRO OKUKOJI MATSUOKA
Author information
JOURNAL FREE ACCESS

1997 Volume 4 Issue 3 Pages 83-99

Details
Abstract
Most Japanese texts are produced with Japanese word processors.As Japanese textsconsist of phonograms, KANA, and ideograms, KANJI, Japanese word processorsalways use KANA-KANJI conversion in which KANA sequences input through thekeyboard are converted into KANA-KANJI sequences.Therefore, Japanese textssuffer from homophone errors caused by erroneous KANA-KANJI conversion.Ahomophone error occurs when a KANA sequence is converted into the wrong wordwhich has the same reading.Detecting homophone errors is an important topic in Japanese text revision support systems.We have already proposed a high performancemethod for handling Japanese homophone errors in compound nouns usedin REVISE.The method, however, has some drawbacks.To compensate for thesedrawbacks, this paper describes a method for detecting Japanese homophone errorsin compound nouns that uses character cooccurrence.Character cooccurrence canbe easily collected from existing texts without any analysis.Therefore, this methodcan be used, in a Japanese revision support system, as a complementary method forhandling Japanese homophone errors in compound nouns.Moreover, as this methoddepends only on character cooccurrence, it can be applied not only to homophoneerrors but also other types of errors such as character deletion.
Content from these authors
© The Association for Natural Language Processing
Previous article
feedback
Top