IEEJ Transactions on Sensors and Micromachines
Online ISSN : 1347-5525
Print ISSN : 1341-8939
ISSN-L : 1341-8939
Intelligent Auditory Sensing using Nonlinear Loudness/Pitch/Timbre Decomposition Operators
Mototsugu AbeShigeru Ando
Author information
JOURNAL FREE ACCESS

1997 Volume 117 Issue 4 Pages 209-214

Details
Abstract
For extracting simple and informative measures for describing time-varying natures of musical sound and speech, we construct a wavelet energy distribution and orthogonally decompose instantaneous changing energy of it into three primary components: 1) loudness change, 2) pitch shift, and 3) timbre change according to the coherency in power magnification and frequency shift. The decomposition is performed in the time-frequency gradient space, and the power ratios of three components are transferred as signals to higher processing stages. Several experiments show that these signals have superior resolution and sensitivity for segmenting dynamical nature of speech, musical sounds, and so on.
Content from these authors
© The Institute of Electrical Engineers of Japan
Previous article Next article
feedback
Top