主催: 人工知能学会
会議名: 第103回言語・音声理解と対話処理研究会
回次: 103
開催地: 早稲田大学 40号館 グリーン・コンピューティング・システム研究開発センター
開催日: 2025/03/20 - 2025/03/22
p. 214-217
Stuttered speech presents significant challenges for automatic speech recognition (ASR) due to its irregular patterns and the scarcity of annotated data. This limitation hinders the development of robust systems capable of accurately recognizing and processing stuttered speech. To address these issues, this study propose a novel approach that leverages text-to-speech (TTS) technology for data augmentation, enabling the synthesis of realistic stuttered speech to supplement existing datasets. Using this augmented data, this study develop an ASR system within a speech translation framework designed to transform stuttered speech into fluent text.