Proceedings of the Annual Conference of JSAI
Online ISSN : 2758-7347
38th (2024)
Session ID : 2C1-GS-7-01
Conference information

Toyama Dialect Recognition and Conversion to Standard Japanese via Deep Learning
*Yuka HORIMOTOItsugun CHOHiroaki SAITO
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Details
Abstract

Motivated by a deep affection for Toyama, the study focuses on speech recognition of the Toyama dialect. Despite an appreciation for its unique language style, it poses communication challenges with individuals from different regions. Therefore, this study aims to develop a system that converts the Toyama dialect into standard Japanese by speech recognition, facilitating communication for visitors from other areas. We employed wav2vec 2.0 for the speech recognition model and used two GPT-2 models for standard Japanese conversion model. We created a Toyama dialect corpus and enhanced its quality via meticulous transcription. All speech data underwent smoothing via RMS and data augmentation through masking during the training. In the experiments, we employed CER and WER as automatic evaluations, and the human evaluations focused on semantic equivalence and grammaticality. Empirical studies demonstrate that our model outperformed the baseline, and the effectiveness of our approach is verified in the discussion.

Content from these authors
© 2024 The Japanese Society for Artificial Intelligence
Previous article Next article
feedback
Top