Journal of Information Processing
Online ISSN : 1882-6652
ISSN-L : 1882-6652
A Survey of Domain Adaptation for Machine Translation
Chenhui ChuRui Wang
Author information
JOURNAL FREE ACCESS

2020 Volume 28 Pages 413-426

Details
Abstract

Neural machine translation (NMT) is a deep learning based approach for machine translation, which outperforms traditional statistical machine translation (SMT) and yields the state-of-the-art translation performance in scenarios where large-scale parallel corpora are available. Although a high-quality and domain-specific translation is crucial in the real world, domain-specific corpora are usually scarce or nonexistent, and thus vanilla NMT performs poorly in such scenarios. Domain adaptation that leverages both out-of-domain parallel corpora as well as monolingual corpora for in-domain translation, is very important for domain-specific translation. In this paper, we give a comprehensive survey of the state-of-the-art domain adaptation techniques for MT. Because of the current dominance of NMT in MT research, we give a brief review of domain adaptation for SMT, but put most of our effort into the survey of domain adaptation for NMT. We hope that this paper will be both a starting point and a source of new ideas for researchers and engineers who are interested in domain adaptation for MT.

Content from these authors
© 2020 by the Information Processing Society of Japan
Previous article Next article
feedback
Top