Journal of Natural Language Processing
Online ISSN : 2185-8314
Print ISSN : 1340-7619
ISSN-L : 1340-7619
Paper
Annotation of Web Documents for Automatic Summarization to Verify Information Credibility
Hideyuki ShibukiMasahiro NakanoRintaro MiyazakiMadoka IshioroshiKoichi KanekoTakahiro NagaiTatsunori Mori
Author information
JOURNAL FREE ACCESS

2014 Volume 21 Issue 2 Pages 157-212

Details
Abstract
Over a span of three years, we have constructed and improved four corpora that are the basis for generating summaries for the verification of information credibility. The summary generated to verify the credibility of information is a brief document composed of extracts from Web documents; it provides material to the user for judging the validity of a statement. In this paper, we describe a set of tags designed for observing annotation and preparing a gold standard for the summary. Further, we describe the method of annotation. Because examining each web document for its appropriateness in contributing to the summary is difficult, we describe the methodology of obtaining appropriate documents. Furthermore, we share our observations and learnings from the process of constructing these corpora.
Content from these authors
© 2014 The Association for Natural Language Processing
Previous article Next article
feedback
Top