Abstract
There have been many researches on important sentence extraction.Now, readability of summary is drawing attention of researchers, who wish to realize informative summary creation.To keep coherence of summary, we focus on associative relations between subjects.In this paper, we propose a method to produce easy-to-read summary by maximizing the sum of subject-flows in it.At first, our system divides sentences into paragraphs at segmentation points where the flow is week, and constructs a multi-layer paragraph tree structure.Then, by analyzing subject-flow, the system finds introductory paragraphs and conclusive paragraphs.Finally, the system decides dispensable paragraphs using a threshold and an estimate of their contribution to surrounding subject-flows.The system automatically adjusts the threshold to minimize error of the compression rate.As a consequence of this, we can get a readable summary which has a strong associative coherence.As an experimental result using newspaper editorial articles, we confirmed that our system produces more readable summaries than a baseline method, and that 77.5% of summaries of which compression rate is 30% keep the conclusion of the original article.