Journal of Information Processing
Online ISSN : 1882-6652
Geographical Entity Annotated Corpus of Japanese Microblogs
Koji MatsudaAkira SasakiNaoaki OkazakiKentaro Inui
Author information
JOURNALS FREE ACCESS

Volume 25 (2017) Pages 121-130

Details
Download PDF (4388K) Contact us
Abstract

This paper addresses the issues in the task of annotating geographical entities on microblogs and reports the preliminary results of our efforts to annotate Japanese microblog texts. Unlike prior work, we aim at annotating not only geographical location entities but also facility entities, such as stations, restaurants and schools. We discuss (i) how to build a gazetteer of geographical entities with a sufficiently broad coverage, (ii) what types ambiguities that need to be considered, (iii) why the annotator tends to disagree, and (iv) what technical problems should be addressed to automate the task of annotating the geographical entities. All the annotation data and the annotation guidelines are publicly available for research purposes from our web site.

Information related to the author
© 2017 by the Information Processing Society of Japan
Previous article Next article

Recently visited articles
feedback
Top