人工知能学会全国大会論文集
Online ISSN : 2758-7347
27th (2013)
セッションID: 4B1-3
会議情報

A Corpus for Studies on Scientific Writing Assistance
*Ngan Nguyen宮尾 祐介
著者情報
会議録・要旨集 フリー

詳細
抄録

Along with the increasing number of non-native speakers of English, the demand for writing assistance applications, including the automatic proofreading application for advanced learners, is raising new challenges for natural language processing (NLP). Previous research on writing assistance has mostly focused on correcting spelling errors and grammatical errors. However, the proofreading process, which is required by the advanced learners, is not only to correct grammatical errors, but also to paraphrase a sentence, when necessary, to make it become more fluent and less awkward. To satisfy such requirements, this work aims at constructing a corpus to support research on writing assistance techniques for advanced English learners. Our corpus is a collection of written work of non-native researchers which has been proofread by a English native speakers. A new annotation scheme was then used to capture both the spelling/grammatical error corrections and the paraphrases made by English native proofreaders. The resulting corpus contains 3485 pairs of original and revised sentences, of which, 2516 pairs contain grammatical and/or paraphrase corrections.

著者関連情報
© 2013 The Japanese Society for Artificial Intelligence
前の記事 次の記事
feedback
Top