音声処理単位の自動獲得と音声認識・符号化への応用

中川 聖一; 斎藤 稔; 升方 幹雄

doi:10.11517/jjsai.13.4_619

抄録

In this paper, we propose a method to acquire speech procesing units automatically that are suitable for automatic processing, not based on prior units like syllables or phonemes. In addition, while the units are acquired automatically, the acquisition process of concept of speech unit on human is taken into consideration, so the restriction such that "speech utterances to the same word are represented by the same unit sequence" is added. We acquired speech units using procudures of template matching, applied these units to speech recognition and found that it needs to take into consideration the variation of speech patterns. Therefore, we modelled the speech patterns corresponding to acquired units by using HMMs and higher recognition rates were obtained by training of HMMs with these units than those by the original template units. Therefore, a method to acquire units based on ergodic HMMs from an initial step was investigated in this paper. When we evaluated these units by word recognition experiments, a high recognition rate 99.5% at 216 words was obtained, with consideration of the above restriction at the both steps of acquisition process of units and registration of word dictionary (on the conditions of phoneme-like duration and 64 numbers of units). Finally, we compared the acquired units with phonetic units and found that the former is better than the latter on spoken word recognition.

著者関連情報

お気に入り & アラート

閲覧履歴

発行機関からのお知らせ

PDF閲覧時に認証を求められる記事がございます（発行後2年間）が，人工知能学会の個人会員は無料で閲覧可能です．認証のための購読者番号やパスワードは会員マイページ（ユース会員の場合はジュニア・ユース会員サイト）にログインし「お知らせ」にてご確認下さい（会員情報管理システムとオンラインで連携していないため，パスワードは同システムとは異なります．また，認証情報の更新は偶数月の月末に実施しております．新規入会された方は利用できるまでしばらくお待ちください）．個人会員以外は記事複製申込フォームから購入いただけます．また，アマゾンにて冊子版あるいはKindle版を購入いただくことも可能です．

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）