候補間の表層的差異に着目した地名の所属国推定

佐野 智久; 延澤 志保; 岡本 紘幸; 鈴木 宏哉; 松原 正樹; 斎藤 博昭

doi:10.5715/jnlp.17.1_29

Paper

Toponym Resolution Based on Surface Difference among Candidates

Tomohisa Sano, Shiho Hoshi Nobesawa, Hiroyuki Okamoto, Hiroya Susuki, Masaki Matsubara, Hiroaki Saito

Author information

Keywords: Toponym Resolution, Language Identification, Proper Noun Recognition, Multilingual Processing

JOURNAL FREE ACCESS

2010 Volume 17 Issue 1 Pages 1_29-1_54

DOI https://doi.org/10.5715/jnlp.17.1_29

Details

Abstract

There have been many researches on toponym resolution as an approach to solve the unknown word problem. In this paper we propose an area candidate estimation method for toponyms, to assign area information to unknown toponyms. Our aim is to expand the target toponyms to non-restricted domains. Thus we aim for a simple system avoiding the use of gazeteers and context information. Our method is based only on surface information to estimate area candidates to where the toponyms may belong. Toponym resolution can be difficult because of linguistic or geographic reasons. Focusing on the surface difference among probable countries, we constructed a system containing a reduction phase for a rough examination and a selection phase for a detailed examination among them. By our effective combination of these two phases, we succeeded in gaining high precision rate maintaing high recall rate.

Corresponding author

Register with J-STAGE for free!