音声認識結果と誤読候補リストを用いた読み間違い検出

齊藤 新; 松崎 拓也

doi:10.11517/pjsai.JSAI2024.0_1D3GS704

Abstract

We have developed a method for detecting reading errors in Japanese speech data. First, speech recognition is performed to transcribe a speech to the form of a phoneme sequence, and then it is checked whether it includes reading errors. In order to distinguish between errors in speech recognition and actual reading errors, we create a candidate list of reading errors for each morpheme, select the one with the smallest edit distance from the speech recognition result among the correct answer and the candidate reading errors, and detect it as a reading error if it is different from the correct reading. We conducted experiments on speech data in the LaboroTVspeech corpus and the Japanese Spoken Language Corpus, as well as synthetic speech. The results confirmed that the method is effective when the speech actually contains reading errors, although there were many cases in which reading errors were mis-detected even when the correct reading was made. In particular, in experiments with synthesized speech, the method was able to accurately detect misreading in 80.0% of the cases, including how a word was mispronunciated, and succeeded in detecting 98.6% of wrongly pronunciated morphemes.

Content from these authors

Favorites & Alerts

Corresponding author

Conference information

Register with J-STAGE for free!