複数の観点で分類した自然言語処理用シソーラス

国分 芳宏; 岡野 弘行

doi:10.5715/jnlp.17.1_247

Abstract

Instead of a thesaurus specialized in conventional information retrieval, we developed a thesaurus of 420,000 terms for the purpose of natural language processing such as parsing or the term standardization. Because each entry term has a large number of terms with various semantic relations, we introduce a facet and classify them for finding relative terms easily. Furthermore, we distinguish discriminatory terms, and fluctuating Japanese spellings. We described points to keep in mind and future tasks in making a thesaurus. Our package has the connecting function with the Internet and the other dictionaries.

Content from these authors

Licensed under CC BY 4.0
https://creativecommons.org/licenses/by/4.0/

Favorites & Alerts

Add to favorites
Additional info alert
Citation alert
Authentication alert

Corresponding author

Register with J-STAGE for free!