感情の複数クラス分割による自然発話音声からの感情の強弱認識に関する検討

高橋 誠治; 矢野 良和; 道木 慎二; 大熊 繁

doi:10.14864/fss.24.0.12.0

Abstract

This paper proposes a method for emotion recognition in unintentional speech. Speech data labeled to certain one consists of several kind of emotion utterance. Even though speech data is assigned to same label, corresponding prosodic features are dissimilar with each other. So, it is hard to distinguish one class to the others. We assume the reason that several subclasses are distributed and overlapped in the feature space. In this paper, we propose the technique to improve recognition rate by detecting multiple hidden series from emotional speech. Emotional speech data with the same label are divided into multiple hidden subclasses according to prosodic feature by k-means. A set of similar hidden subclasses with various label are grouped. Grouped speech data in one hidden emotion category train one SVM, so that several SVMs for several hidden emotion categories. Experimental results show that proposed technique raised recognition rate better than the traditional one.

Content from these authors

Favorites & Alerts

Corresponding author

Register with J-STAGE for free!