自然言語処理
Online ISSN : 2185-8314
Print ISSN : 1340-7619
ISSN-L : 1340-7619
一般論文
Multi-dialect Neural Machine Translation for 48 Low-resource Japanese Dialects
Kaori AbeYuichiroh MatsubayashiNaoaki OkazakiKentaro Inui
著者情報
ジャーナル フリー

2020 年 27 巻 4 号 p. 781-800

詳細
抄録

We present a multi-dialect neural machine translation (NMT) model tailored to Japanese. Although the surface forms of Japanese dialects differ from those of standard Japanese, most of the dialects have common fundamental properties, such as word order, and some also use numerous same phonetic correspondence rules. To take advantage of these properties, we integrate multilingual, syllable-level, and fixed-order translation techniques into a general NMT model. Our experimental results demonstrate that this model can outperform a baseline dialect translation model. In addition, we show that visualizing the dialect embeddings learned by the model can facilitate the geographical and typological analyses of the dialects.

著者関連情報
© 2020 The Association for Natural Language Processing
前の記事 次の記事
feedback
Top