Signal Processing of Speech Using Independent Component Analysis Based on Information Maximization Algorithm

Manabu KOTANI; Satoshi MAEKAWA; Seiichi OZAWA; Kenzo AKAZAWA

doi:10.9746/sicetr1965.36.456

Manabu KOTANI, Satoshi MAEKAWA, Seiichi OZAWA, Kenzo AKAZAWA

Author information

Keywords: independent component analysis, speech, feature extraction, neural networks

JOURNAL FREE ACCESS

2000 Volume 36 Issue 5 Pages 456-458

DOI https://doi.org/10.9746/sicetr1965.36.456

Details

Abstract

We describe what characteristics an independent component analysis can extract from Japanese continuous speech. Speech data was selected from ATR database uttered by a female speaker. The data was recorded at 20kHz sampling frequency and was pre-processed with a whitening filter. The learning algorithm of a network was an information-maximization approach proposed by Bell and Sejnowski. After the learning, most of the basis functions that are columns of a mixing matrix were localized in both time and frequency. Furthermore, we confirmed that there were some basis functions to extract the acoustic feature such as the pitch and the formant of each vowel.

Corresponding author

Register with J-STAGE for free!