言語表現体系の違いに着目した日英機械翻訳機能試験項目の構成

池原 悟; 白井 諭; 小倉 健太郎

doi:10.11517/jjsai.9.4_569

抄録

This paper describes the design of a set of 6200 Japanese-English sentence pairs for testing Japanese to English MT systems. Japanese expressions are organized into 600 test items with consideration for the characteristics of both Japanese and English, an average of 10 sentence pairs were then made for each item. In machine translation, translations between two very different languages (e.g. Japanese and English) are more difficult than between two similiar languages (e.g. Japanese and Korean). This is believed to be due to differences in morphology and in how things are conceived in different language groups. Therefore, we have focused our attention beyond morphological differences to cover differences in perception and concepts. The process of constructing Japanese texts is examined at 5 levels (part of speech, phrase, expression, sentence and text). Based on these 5 levels and considering the differences between the two languages, about 600 test items were chosen to test the basic functions of Japanese to English machine translation systems. Finally examples of these test items were extracted from various documents and publications and combined with specially constructed sentences to make the test set of 6200 sentence pairs.

著者関連情報

お気に入り & アラート

閲覧履歴

発行機関からのお知らせ

PDF閲覧時に認証を求められる記事がございます（発行後2年間）が，人工知能学会の個人会員は無料で閲覧可能です．認証のための購読者番号やパスワードは会員マイページにログインし「お知らせ」にてご確認下さい（会員情報管理システムとオンラインで連携していないため，パスワードは同システムとは異なります．また，認証情報の更新は偶数月の月末に実施しております．新規入会された方は利用できるまでしばらくお待ちください）．個人会員以外は記事複製申込フォームから購入いただけます．また，アマゾンにて冊子版あるいはKindle版を購入いただくことも可能です．

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）