Journal of Natural Language Processing
Online ISSN : 2185-8314
Print ISSN : 1340-7619
ISSN-L : 1340-7619
An IR Similarity Measure which is Tolerant for Morphological Variation
EIKO YAMAMOTOYOSHIYUKI TAKEDAKYOJI UMEMURA
Author information
JOURNAL FREE ACCESS

2003 Volume 10 Issue 1 Pages 63-80

Details
Abstract
In this paper, we propose a measure for information retrieval (IR). This measure is tolerant for morphological variation. When various persons describe the data to retrieve, their notations may vary even if the data describe the same topic. This variation prevents system to retrieve all of relevant documents for the input sentence. Although human can handle this variation, computers usually can not handle this. Edit distance is a well-known measure that can cope with this variation. We have used this measure for information retrieval and found that its precision is poor. Therefore, we propose to modify this similarity measure to be suitable for information retrieval. We show that this extension improves the performance. We also compared the proposed similarity measure with the popular similarity measures used in many information retrieval systems.
Content from these authors
© The Association for Natural Language Processing
Previous article Next article
feedback
Top