同時通訳コーパスの設計と構築

松原 茂樹; 相澤 靖之; 河口 信夫; 外山 勝彦; 稲垣 康善

doi:10.50837/istk.0108

抄録

This paper describes a large-scale spoken language corpus of simultaneous interpreting, which has been constructed at the Center for Integrated Acoustic Information Research (CIAIR), Nagoya University. The corpus, among other things, has the following charac-teristics: (1) English and Japanese speeches are recorded in parallel, (2) the data contain monologue and dialogue speeches, and (3) the exact beginning and ending times are provided for each utterance. We have collected a total of about 65 hours of speech data and transcribed them into ASCII text files (about 367,000 morphemes in 22,000 utterance units). This paper also outlines the software tools which we have developed for the investigation of the corpus. The corpus will be made publicly available in the near future.

著者関連情報

お気に入り & アラート

お気に入りに追加
追加情報アラート
被引用アラート
認証解除アラート

閲覧履歴

Experimental Verification of Dynamic Vector Simulators for Self-Excited Hybrid-Field Synchronous Motors
自律的政策提案を促す小学校社会科授業構成 ―興味の発達に着目した社会問題の自覚化をとおして―
[title in Japanese]
Bioadaptability of wearable-type artificial endocrine pancreas was examined
Characteristic of Attributes of Psychiatric Nurses and Psychiatric Hospitals that influence Organizational Commitment

後続誌

通訳翻訳研究

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）