Journal of Information Processing
Online ISSN : 1882-6652
ISSN-L : 1882-6652
Geographical Entity Annotated Corpus of Japanese Microblogs
Koji MatsudaAkira SasakiNaoaki OkazakiKentaro Inui
著者情報
ジャーナル フリー

2017 年 25 巻 p. 121-130

詳細
抄録

This paper addresses the issues in the task of annotating geographical entities on microblogs and reports the preliminary results of our efforts to annotate Japanese microblog texts. Unlike prior work, we aim at annotating not only geographical location entities but also facility entities, such as stations, restaurants and schools. We discuss (i) how to build a gazetteer of geographical entities with a sufficiently broad coverage, (ii) what types ambiguities that need to be considered, (iii) why the annotator tends to disagree, and (iv) what technical problems should be addressed to automate the task of annotating the geographical entities. All the annotation data and the annotation guidelines are publicly available for research purposes from our web site.

著者関連情報
© 2017 by the Information Processing Society of Japan
前の記事 次の記事
feedback
Top