2017 Volume 29 Issue 1 Pages 527-532
While statistical machine translation methods have been developed by using parallel corpus, a technical issue of collecting large amounts of good quality parallel sentence pairs has been raised.With recursive learning, which yields quantification of differences between sentences of one language and sentences of the other language by a statistical machine translation using the parallel corpus, a novel method of parallel corpus revision (clean-up) is proposed in this paper.By applying edit numbers to the sentence difference quantification, we show experimental results of the clean-up using Japanese-English patent parallel corpus.