Journal of Natural Language Processing
Online ISSN : 2185-8314
Print ISSN : 1340-7619
ISSN-L : 1340-7619
Paper
Detection of Quotations and Inserted Clauses and its Application to Dependency Structure Analysis in Spontaneous Japanese
Ryoji HamabeKiyotaka UchimotoTatsuya KawaharaHitoshi Isahara
Author information
JOURNAL FREE ACCESS

2009 Volume 16 Issue 1 Pages 1_3-1_23

Details
Abstract
Japanese dependency structure is usually represented by relationships between phrasal units called bunsetsus. One of the biggest problems with dependency structure analysis in spontaneous speech is that clause boundaries are ambiguous. This paper describes a method for detecting the boundaries of quotations and inserted clauses and that for improving the dependency accuracy by applying the detected boundaries to dependency structure analysis. The quotations and inserted clauses are determined by using an SVM-based text chunking method that considers information on morphemes, pauses, etc. The information on automatically analyzed dependency structure is also used to detect the beginning of the clauses. Our evaluation experiment using Corpus of Spontaneous Japanese (CSJ) showed that the automatically estimated boundaries of quotations and inserted clauses helped to improve the accuracy of dependency structure analysis from 77.7% to 78.7% .
Content from these authors
© 2009 The Association for Natural Language Processing
Previous article Next article
feedback
Top