構文的曖昧性を持つ英語固有表現とその対訳表現の獲得

吉見 毅彦; 九津見 毅; 小谷 克則; 佐田 いち子; 井佐原 均

doi:10.5715/jnlp.12.5_91

Abstract

This paper proposes a method of extracting a bilingual pair of a syntactically am-biguous named entity and its counterpart from a sentence-aligned English-Japanese parallel corpus.This method computes the degree of semantic and phonetic similar-ities between an English named entity and its translation candidate, and calculates the overall score of the pair as the weighted sum of the two kinds of scores. It avoids extracting English named entities with wrong prepositional phrase attach-ment and/or wrong scope of coordination. In an experiment using a parallel corpus of Yomiuri Shimbun and The Daily Yomiuri, the proposed method has achieved the F-value of 0.678, which surpasses 0.583 marked by a baseline method.

Content from these authors

Favorites & Alerts

Add to favorites
Additional info alert
Citation alert
Authentication alert

Corresponding author

Register with J-STAGE for free!