Journal of Japan Society for Fuzzy Theory and Intelligent Informatics
Online ISSN : 1881-7203
Print ISSN : 1347-7986
ISSN-L : 1347-7986
Short Notes
Parallel Corpus Clean-up Based on Recursive Learning
Tsutomu MATSUNAGADaisuke SATOMasami HARA
Author information
JOURNAL OPEN ACCESS

2017 Volume 29 Issue 1 Pages 527-532

Details
Abstract

While statistical machine translation methods have been developed by using parallel corpus, a technical issue of collecting large amounts of good quality parallel sentence pairs has been raised.With recursive learning, which yields quantification of differences between sentences of one language and sentences of the other language by a statistical machine translation using the parallel corpus, a novel method of parallel corpus revision (clean-up) is proposed in this paper.By applying edit numbers to the sentence difference quantification, we show experimental results of the clean-up using Japanese-English patent parallel corpus.

Content from these authors
© 2017 Japan Society for Fuzzy Theory and Intelligent Informatics
Previous article
feedback
Top