SCIS & ISIS
SCIS & ISIS 2006
Session ID : TH-F2-1
Conference information

TH-F2 Language-Mediated Information Processing
Extracting important information from natural language processing article abstracts and visualizing them
*Masaki MurataMakoto KikuiQing MaToshiyuki KanamaruHitoshi Isahara
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Details
Abstract

We studied ways of extracting important information from natural language processing article abstracts. While many kinds of information extraction, such as from newspapers, web materials, or biology article abstracts, already exist, no papers have been written about extracting natural language processing information. We hope that this study will be useful for natural language processing researchers. We defined nine categories that contain important expressions for natural language processing article abstracts. We constructed a method of extracting these expressions by using machine learning methods. Our method extracted these expressions with an F-measure of 0.67. When we considered partially correct expressions to be correct, the F-measure increased to 0.73. In particular, important expressions belonging to four categories (Accuracy, Field, Language, and Name) were automatically extracted at a high F-measure (about 0.8 to 0.9) using our method. We next constructed various kinds of visualization tools using important extracted expression. They can display the abstract of a paper highlighting extracted important expressions with color, display abstracts in a table including extracted inpotant expressions in each row, and make a graph indicating the distribution and trend of papers including important expressions for each category.

Content from these authors
© 2006 Japan Society for Fuzzy Theory and Intelligent Informatics
Previous article Next article
feedback
Top