Journal of Natural Language Processing
Online ISSN : 2185-8314
Print ISSN : 1340-7619
ISSN-L : 1340-7619
Paper
Temporal Information Annotation on ‘Balanced Corpus of Contemporary Written Japanese’
Hikari KonishiMasayuki AsaharaKikuo Maekawa
Author information
JOURNAL FREE ACCESS

2013 Volume 20 Issue 2 Pages 201-221

Details
Abstract
Temporal information is important for grounding event expressions on a timeline. Temporal expression extraction has been performed as numerical representation extraction, which is a subtask of named entity extraction. For English texts, evaluation workshops were held in which temporal expressions were extracted and normalized. An annotation schema, TimeML, was designed to annotate events and temporal expressions, and several annotated corpora of newswire texts were developed. However, a schema for temporal information and normalization of Japanese texts has not been designed. This paper proposes an annotation schema, which is based on TimeML, for Japanese temporal information. We annotate the temporal information in parts of the ‘Balanced Corpus of Contemporary Written Japanese’. We identify several problems in the annotation and discuss the steps to be taken to ground Japanese event expressions on a timeline.
Content from these authors
© 2013 The Association for Natural Language Processing
Previous article Next article
feedback
Top