An Approach to NMT Re-Ranking Using Sequence-Labeling for Grammatical Error Correction

Bo Wang; Kaoru Hirota; Chang Liu; Yaping Dai; Zhiyang Jia

doi:10.20965/jaciii.2020.p0557

Regular Papers

An Approach to NMT Re-Ranking Using Sequence-Labeling for Grammatical Error Correction

Bo Wang, Kaoru Hirota, Chang Liu, Yaping Dai, Zhiyang Jia

著者情報

キーワード: grammatical error correction, neural machine translation, transformer, sequence-labeling

ジャーナルオープンアクセス

2020 年 24 巻 4 号 p. 557-567

DOI https://doi.org/10.20965/jaciii.2020.p0557

詳細

抄録

An approach to N-best hypotheses re-ranking using a sequence-labeling model is applied to resolve the data deficiency problem in Grammatical Error Correction (GEC). Multiple candidate sentences are generated using a Neural Machine Translation (NMT) model; thereafter, these sentences are re-ranked via a stacked Transformer following a Bidirectional Long Short-Term Memory (BiLSTM) with Conditional Random Field (CRF). Correlations within the sentences are extracted using the sequence-labeling model based on the Transformer, which is particularly suitable for long sentences. Meanwhile, the knowledge from a large amount of unlabeled data is acquired through the pre-trained structure. Thus, completely revised sentences are adopted instead of partially modified sentences. Compared with conventional NMT, experiments on the NUCLE and FCE datasets demonstrate that the model improves the F_0.5 score by 8.22% and 2.09%, respectively. As an advantage, the proposed re-ranking method has the advantage of only requires a small set of easily computed features that do not need linguistic inputs.

責任著者(Corresponding author)

訂正情報

ファンド情報

1.助成機関/事業名: Beijing Natural Science foundation

2.助成機関/事業名: “Thousand Talents Plan” (the State Recruitment Program of Global Experts)

J-STAGEへの登録はこちら（無料）