日本語話し言葉の係り受け解析と文境界推定の相互作用による高精度化

下岡 和也; 内元 清貴; 河原 達也; 井佐原 均

doi:10.5715/jnlp.12.3_3

Abstract

This paper describes methods to detect dependencies between Japanese phrasal units called bunsetsus, and sentence boundaries in a spontaneous speech corpus.In spontaneous monologues, the biggest problem with dependency structure analysis is that sentence boundaries are ambiguous.In this paper, we propose two methods for improving the accuracy of sentence boundary detection in spontaneous Japanese speech: One is based on statistical machine translation using dependency information and the other is based on text chunking using SVM.An F-measure of 84.9 was achieved for the accuracy of sentence boundary detection by using the proposed methods. The accuracy of dependency structure analysis was also improved from 75.2% to 77.2% by using automatically detected sentence boundaries.Furthermore, the accuracy of dependency structure analysis and that of sentence boundary detection were improved by interactively using the counterpart results.

Content from these authors

Favorites & Alerts

Corresponding author

Register with J-STAGE for free!