Transactions of the Japanese Society for Artificial Intelligence
Online ISSN : 1346-8030
Print ISSN : 1346-0714
ISSN-L : 1346-0714
Technical Papers
A Method for Extracting Important Segments from Documents Using Support Vector Machines
Toward Automatic Text Summarization
Daisuke SuzukiAkira Utsumi
Author information
JOURNAL FREE ACCESS

2006 Volume 21 Issue 4 Pages 330-339

Details
Abstract
In this paper we propose an extraction-based method for automatic summarization. The proposed method consists of two processes: important segment extraction and sentence compaction. The process of important segment extraction classifies each segment in a document as important or not by Support Vector Machines (SVMs). The process of sentence compaction then determines grammatically appropriate portions of a sentence for a summary according to its dependency structure and the classification result by SVMs. To test the performance of our method, we conducted an evaluation experiment using the Text Summarization Challenge (TSC-1) corpus of human-prepared summaries. The result was that our method achieved better performance than a segment-extraction-only method and the Lead method, especially for sentences only a part of which was included in human summaries. Further analysis of the experimental results suggests that a hybrid method that integrates sentence extraction with segment extraction may generate better summaries.
Content from these authors
© 2006 JSAI (The Japanese Society for Artificial Intelligence)
Previous article Next article
feedback
Top