主催: Japan Society of Kansei Engineering
会議名: The 8th International Symposium on Affective Science and Engineering
回次: 8
開催地: Online Academic Symposium
開催日: 2022/03/27
In general, learning English is difficult for non-native speakers because of the differences in vowels and consonants. There are some ways to practice English pronunciation such as shadowing, however, if the audio’s voice features greatly differ from the learner’s voice, it might impede learning and sound reproduction. In order to solve this problem, we propose a method to make the pronunciation data of the model pronunciation resemble the learner’s own voice by using UTAU and Interactive Evolutionary Computation. As a result of the experiments, we found that this method was capable of searching for highly evaluated solutions. The Wilcoxon signed-rank test was used to examine the statistical difference between the evaluations of the initial and final generations, and a significant difference was observed at P<0.01. Regarding to the pitch parameters, we could find different tendencies between males and females. This means the parameters were actually making the voice similar to examinee’s voice. However, there were some problems, such as the parameters that did not work well, the UTAU voice quality, the lack of female examinees, and so on. We plan to eliminate or at least reduce the effects from those problems in future experiments and make a better system for English learners so that they can learn more efficiently.