Journal of Natural Language Processing
Online ISSN : 2185-8314
Print ISSN : 1340-7619
ISSN-L : 1340-7619
General Paper
Spoken-Written Japanese Conversion for Japanese-English University-Lecture Translation
Ryota NakaoChenhui ChuSadao Kurohashi
Author information
JOURNAL FREE ACCESS

2021 Volume 28 Issue 4 Pages 1034-1052

Details
Abstract

In machine translation of spoken language, it is known that phenomena specific to spoken language have a negative impact on translation accuracy. Therefore, in this study, as a preprocessing step for Japanese-English translation in our university lecture translation system, we improve the translation accuracy by automatically converting spoken-style Japanese texts to written-style. First, we create a corpus consisting of Japanese transcriptions of university lectures, their conversions into written language, and the corresponding English texts. Next, we train spoken-written conversion models and Japanese-English translation models using the corpus. As a result, we show that spoken-written Japanese conversion improves the accuracy of Japanese-English translation. In addition, we quantify which phenomena affect translation accuracy and to what extent.

Content from these authors
© 2021 The Association for Natural Language Processing
Previous article Next article
feedback
Top