JSAI Technical Report, SIG-SLUD
Online ISSN : 2436-4576
Print ISSN : 0918-5682
103rd (Mar.2025)
Conference information

Addressing ASR Challenges for Stuttered Speech Through a Speech Translation Framework
Natsumi KUBOTASakti SAKRIANI
Author information
CONFERENCE PROCEEDINGS RESTRICTED ACCESS

Pages 214-217

Details
Abstract

Stuttered speech presents significant challenges for automatic speech recognition (ASR) due to its irregular patterns and the scarcity of annotated data. This limitation hinders the development of robust systems capable of accurately recognizing and processing stuttered speech. To address these issues, this study propose a novel approach that leverages text-to-speech (TTS) technology for data augmentation, enabling the synthesis of realistic stuttered speech to supplement existing datasets. Using this augmented data, this study develop an ASR system within a speech translation framework designed to transform stuttered speech into fluent text.

Content from these authors
© 2025 The Japaense Society for Artificial Intelligence
Previous article Next article
feedback
Top