Journal of Natural Language Processing
Online ISSN : 2185-8314
Print ISSN : 1340-7619
ISSN-L : 1340-7619
Automatically Extracting Collocations Based on Words Position Information in Corpora
HIROKI ODAKENJI KITA
Author information
JOURNAL FREE ACCESS

1998 Volume 5 Issue 1 Pages 79-99

Details
Abstract
Collocations, which are cohesive and recurrent word clusters, play an important role in many natural language application systems. In this paper, we present a set of new techniques for automatically identifying or extracting collocations from corpora. These techniques are based on words position information, and produce a wide range of collocations, including continuous or discontinuous collocations. The effectiveness has been confirmed by evaluation experiments using the ADD (ATR Dialogue Database) corpus.
Content from these authors
© The Association for Natural Language Processing
Previous article Next article
feedback
Top