単語の位置情報に基づくコーパスからのコロケーションの自動抽出

小田 裕樹; 北 研二

doi:10.5715/jnlp.5.79

Abstract

Collocations, which are cohesive and recurrent word clusters, play an important role in many natural language application systems. In this paper, we present a set of new techniques for automatically identifying or extracting collocations from corpora. These techniques are based on words position information, and produce a wide range of collocations, including continuous or discontinuous collocations. The effectiveness has been confirmed by evaluation experiments using the ADD (ATR Dialogue Database) corpus.

Content from these authors

Favorites & Alerts

Add to favorites
Additional info alert
Citation alert
Authentication alert

Corresponding author

Register with J-STAGE for free!