Extracting important information from natural language processing article abstracts and visualizing them

Masaki Murata; Makoto Kikui; Qing Ma; Toshiyuki Kanamaru; Hitoshi Isahara

doi:10.14864/softscis.2006.0.112.0

SCIS & ISIS 2006

Session ID : TH-F2-1

DOI https://doi.org/10.14864/softscis.2006.0.112.0

Conference information

Host: Japan SOciety for Fuzzy Theory and intelligent informatics

Co-host: The Korea Fuzzy Logic and Intelligent Systems Society, IEEE Computational Intelligence Society, The International Fuzzy Systems Association, 21th Century COE Program "Creation of Agent-Based Social Systems Sciences"

TH-F2 Language-Mediated Information Processing

Extracting important information from natural language processing article abstracts and visualizing them

*Masaki Murata, Makoto Kikui, Qing Ma, Toshiyuki Kanamaru, Hitoshi Isahara

Author information

Keywords: natural language processing, extracting important information, visualization

CONFERENCE PROCEEDINGS FREE ACCESS

Details

Abstract

We studied ways of extracting important information from natural language processing article abstracts. While many kinds of information extraction, such as from newspapers, web materials, or biology article abstracts, already exist, no papers have been written about extracting natural language processing information. We hope that this study will be useful for natural language processing researchers. We defined nine categories that contain important expressions for natural language processing article abstracts. We constructed a method of extracting these expressions by using machine learning methods. Our method extracted these expressions with an F-measure of 0.67. When we considered partially correct expressions to be correct, the F-measure increased to 0.73. In particular, important expressions belonging to four categories (Accuracy, Field, Language, and Name) were automatically extracted at a high F-measure (about 0.8 to 0.9) using our method. We next constructed various kinds of visualization tools using important extracted expression. They can display the abstract of a paper highlighting extracted important expressions with color, display abstracts in a table including extracted inpotant expressions in each row, and make a graph indicating the distribution and trend of papers including important expressions for each category.

Corresponding author

Register with J-STAGE for free!