Speech Emotion Recognition Based on Parametric Filter and Fractal Dimension

Xia MAO; Lijiang CHEN

doi:10.1587/transinf.E93.D.2324

Abstract

In this paper, we propose a new method that employs two novel features, correlation density (C_d) and fractal dimension (F_d), to recognize emotional states contained in speech. The former feature obtained by a list of parametric filters reflects the broad frequency components and the fine structure of lower frequency components, contributed by unvoiced phones and voiced phones, respectively; the latter feature indicates the non-linearity and self-similarity of a speech signal. Comparative experiments based on Hidden Markov Model and K Nearest Neighbor methods are carried out. The results show that C_d and F_d are much more closely related with emotional expression than the features commonly used.

Content from these authors

Favorites & Alerts

Corresponding author

Register with J-STAGE for free!