Transactions of the Society of Instrument and Control Engineers
Online ISSN : 1883-8189
Print ISSN : 0453-4654
ISSN-L : 0453-4654
Paper
Intelligent Human Tracking Based on Multimodal Integration
Keisuke NAKAMURAKazuhiro NAKADAIFutoshi ASANOHirofumi NAKAJIMAGökhan INCE
Author information
JOURNAL FREE ACCESS

2012 Volume 48 Issue 6 Pages 349-358

Details
Abstract

Localization and tracking of humans are essential research topics in robotics. In particular, Sound Source Localization (SSL) has been of great interest. Despite the numerous reported methods, SSL in a real environment had mainly three issues; robustness against noise with high power, no framework for selective listening to sound sources, and tracking of inactive and/or noisy sound sources. For the first issue, we extended Multiple SIgnal Classification by incorporating Generalized Eigen Value Decomposition (GEVD-MUSIC) so that it can deal with high power noise and can select target sound sources. For the second issue, we proposed Sound Source Identification (SSI) based on hierarchical Gaussian mixture models and integrated it with GEVD-MUSIC to realize a function to listen to a specific sound source according to the sort of the sound source. For the third issue, auditory and visual human tracking were integrated using particle filtering. These three techniques are integrated into an intelligent human tracking system. Experimental results showed that integration of SSL and SSI successfully achieved human tracking only by audition, and the audio-visual integration showed considerable improvement in tracking by compensating the loss of auditory or visual information.

Content from these authors
© 2012 The Society of Instrument and Control Engineers
Previous article Next article
feedback
Top