Journal of Natural Language Processing
Online ISSN : 2185-8314
Print ISSN : 1340-7619
ISSN-L : 1340-7619
Resolving Overlapping Ambiguities and Selecting Correct Word Sequence in Chinese Using Internet Corpus
Dongli HanHaodong WuTeiji Furugori
Author information
JOURNAL FREE ACCESS

2001 Volume 8 Issue 3 Pages 107-121

Details
Abstract
We propose an effective method for resolving overlapping ambiguities found in sentential analyses of Chinese. It detects the ambiguities by a FBMM scanner, resolves them by using the relevancy value (RV), a statistical measure for word co-occurrences taken from textual data on the Internet, and selects the correct word sequence for the sentence being analyzed. We use contextual information also when RVs are considered not sufficient to resolving the ambiguities and choosing the correct word sequence. An experiment for selecting the desired sequences shows a success rate of about 85%. This result is convincing and far better than those in other comparable studies.
Content from these authors
© The Association for Natural Language Processing
Previous article Next article
feedback
Top