Wikipediaからの連想シソーラス構築プロジェクト

伊藤 雅弘; 中山 浩太郎; 原 隆弘; 西尾 章治郎

doi:10.11517/jsaisigtwo.2009.SWO-020_05

Abstract

Wikipedia, a huge scale Web based encyclopedia, attracts great attention as an invaluable corpus for knowledge extraction because it has various impressive characteristics such as a huge number of articles, live updates, a dense link structure, brief anchor texts and URL identification for concepts. We have already proved that we can use Wikipedia to construct a huge scale accurate association thesaurus. The association thesaurus we constructed covers almost 1.3 million concepts and its accuracy is proved in detailed experiments. In this paper, we introduce our project for constructing a high quality association thesaurus from Wikipedia

Content from these authors

Favorites & Alerts

Corresponding author

Conference information

Register with J-STAGE for free!