Journal of Signal Processing
Online ISSN : 1880-1013
Print ISSN : 1342-6230
ISSN-L : 1342-6230
Deep-Learning-Based Speech Emotion Recognition Using Synthetic Bone-Conducted Speech
Md. Sarwar HosainYosuke SugiuraNozomiko YasuiTetsuya Shimamura
著者情報
ジャーナル フリー

2023 年 27 巻 6 号 p. 151-163

詳細
抄録

Speech emotion recognition has drawn extensive attention in recent years. We propose deep learning (DL)-based speech emotion recognition using synthetic bone-conducted (BC) speech. In our proposed model, air-conducted(AC) speech is transformed to BC speech using an infinite impulse response (IIR) filter. Data augmentation techniques are utilized and the parameters of convolutional neural network (CNN) models are modified to enhance the accuracy of the proposed model. Simulation results demonstrate that the proposed model outperforms the existing models in terms of recognition accuracy for BC speech. The accuracy of the proposed model is 72.50% for BC speech, whereas that of the existing model is 69.83% for AC speech. This is because BC speech can enhance low-frequency components, which is important for recognizing emotions.

著者関連情報
© 2023 Research Institute of Signal Processing, Japan
次の記事
feedback
Top