場所参照表現抽出における言語モデルの時代横断型評価

片山 歩希; 東山 翔平; 大内 啓樹; 坂井 優介; 竹内 綾乃; 坂東 諒; 橋本 雄太; 小木曽 智信; 渡辺 太郎

doi:10.5715/jnlp.32.1103

General Paper (Peer-Reviewed)

Cross-Era Evaluation of Language Models for Location Referring Expression Extraction

Ayuki Katayama, Shohei Higashiyama, Hiroki Ouchi, Yusuke Sakai, Ayano Takeuchi, Ryo Bando, Yuta Hashimoto, Toshinobu Ogiso, Taro Watanabe

Author information

Keywords: Geographic Text Analysis, Location Referring Expression Extraction, Named Entity Recognition, Historical Japanese Text, Language Model

JOURNAL FREE ACCESS

2025 Volume 32 Issue 4 Pages 1103-1128

DOI https://doi.org/10.5715/jnlp.32.1103

Details

Abstract

Automatic extraction of location referring expressions (LREs) can facilitate humanities research by enabling the analysis of large collections of historical texts. In this study, we constructed LRE annotation datasets from early modern and modern travelogues. We then evaluated the performance of Transformer-based contemporary language models in extracting LREs from historical texts by combining these datasets with existing datasets of modern disaster records and contemporary travelogues. Our experiments demonstrated the effectiveness of leveraging contemporary annotated data for LRE extraction from historical texts. However, whereas extraction accuracy on contemporary texts was high (maximum F1 score of 0.890), accuracy on historical texts remained low to moderate (maximum F1 scores of 0.506–0.739), indicating that further model enhancements are needed to better adapt contemporary language models to historical text.

Corresponding author

Register with J-STAGE for free!