ITE Technical Report
Online ISSN : 2424-1970
Print ISSN : 1342-6893
ISSN-L : 1342-6893
32.46
Session ID : ME2008-170
Conference information
Low Complexity Speaker Identification in AAC Domain
Haojun AIMiki HASEYAMA
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Details
Abstract
This paper presents an implementation of a low-complexity speaker identification algorithm working in the compressed audio domain. The goal is to perform speaker modeling and identification without decoding the AAC bitstream to extract speaker dependent features, thus saving important system resource. The silence detection and MFCC parameters are calculated from MDCT coefficient other than from the FFT spectrum. Each speaker is modeled by a GMM, which is trained using the EM algorithm to refine the weight and the parameters of each component. The recognition accuracies of our algorithm reach 97% for ARCTIC database with 16% CPU overload comparing to the algorithms based on the analysis of the decoded PCM signals.
Content from these authors
© 2008 The Institute of Image Information and Television Engineers
Previous article Next article
feedback
Top