JSAI Technical Report, Type 2 SIG
Online ISSN : 2436-5556
Our Association Thesaurus Construction Project from Wikipedia
Masahiro ITOKotaro NAKAYAMATakahiro HARAShojiro NISHIO
Author information
RESEARCH REPORT / TECHNICAL REPORT FREE ACCESS

2009 Volume 2009 Issue SWO-020 Pages 05-

Details
Abstract

Wikipedia, a huge scale Web based encyclopedia, attracts great attention as an invaluable corpus for knowledge extraction because it has various impressive characteristics such as a huge number of articles, live updates, a dense link structure, brief anchor texts and URL identification for concepts. We have already proved that we can use Wikipedia to construct a huge scale accurate association thesaurus. The association thesaurus we constructed covers almost 1.3 million concepts and its accuracy is proved in detailed experiments. In this paper, we introduce our project for constructing a high quality association thesaurus from Wikipedia

Content from these authors
© 2009 Authors
Previous article Next article
feedback
Top