外国語教育メディア学会関東支部研究紀要
Online ISSN : 2432-3071
Print ISSN : 2432-3063
研究論文
Automated Measures of Oral Fluency and Pronunciation with Automatic Speech Recognition:
Evaluating the Measures and Creating a Combined Rating Model
SPRING Ryan
著者情報
ジャーナル フリー HTML

2025 年 10 巻 p. 1-25

詳細
抄録

Though automatic speech recognition (ASR) can be easily used to create web-based speaking tools, there is a need to create new measures from the generated ASR transcript and evaluate how well these measures correlate to human rater scoring. This study utilized 61 speaking test audio file samples taken from a tested read-aloud task performed by Japanese EFL learners. Six human raters judge their pronunciation and fluency. ASR transcripts were obtained and transformed into a number of measures of fluency and pronunciation. Raw correlation and performance in regression models were used to evaluate the measures, and a scoring model was created to match raters’ amalgamated scores on a 1–5 scale. I found that the time to complete the task (T), the number of extra words in the ASR transcript (extraW), speech rate (SR) were meaningful measures of fluency. I also found that though penalized pronunciation score (penP) is a meaningful measure of pronunciation, fine-grained measures based on the inclusion of phrases were equally meaningful. Finally, several of these measures were able to be combined into a scoring model that showed 100% accuracy in predicting the original 61 audio files. However, it is unknown how well it will score new datasets.

著者関連情報
© 2025 The Japan Association for Language Education and Technology, Kanto Chapter
次の記事
feedback
Top