人工知能学会研究会資料 言語・音声理解と対話処理研究会
Online ISSN : 2436-4576
Print ISSN : 0918-5682
103回(2025/3)
会議情報

音声翻訳フレームワークによる吃音音声の自動音声認識に対する課題への取り組み
久保田 なつみサクティ サクリアニ
著者情報
会議録・要旨集 認証あり

p. 214-217

詳細
抄録

Stuttered speech presents significant challenges for automatic speech recognition (ASR) due to its irregular patterns and the scarcity of annotated data. This limitation hinders the development of robust systems capable of accurately recognizing and processing stuttered speech. To address these issues, this study propose a novel approach that leverages text-to-speech (TTS) technology for data augmentation, enabling the synthesis of realistic stuttered speech to supplement existing datasets. Using this augmented data, this study develop an ASR system within a speech translation framework designed to transform stuttered speech into fluent text.

著者関連情報
© 2025 人工知能学会
前の記事 次の記事
feedback
Top