音声研究
Online ISSN : 2189-5961
Print ISSN : 1342-8675
特集「音声研究関連データベースの動向」
日本語CallHomeコーパス(<特集>音声研究関連データベースの動向)
伝 康晴フライ ジョン
著者情報
ジャーナル フリー

2000 年 4 巻 2 号 p. 24-30

詳細
抄録
This article describes the CallHome Japanese (CHJ) corpus and a project for annotating this corpus with various sorts of linguistic tags. The CHJ corpus is a collection of digitized speech data and text transcriptions of 120 spontaneous, unscripted telephone conversations. The annotation of the corpus provides word segmentations, part-of-speech tags and alignment with the speech for all words, semantic classes for nouns and verbs, and argument structures for verbs. A large scale, high quality corpus of naturally occurring conversations with such extensive linguistic annotations will provide a basis for scientific and technological investigation into human speech communication.
著者関連情報
© 2000 日本音声学会
前の記事 次の記事
feedback
Top