The Journal of the Institute of Image Electronics Engineers of Japan
Online ISSN : 1348-0316
Print ISSN : 0285-9831
ISSN-L : 0285-9831
Contributed Papers
Audio-Visual Tracking System for Multi-Modal Interface
Dmitry ZOTKINKazuhiko TAKAHASHITatsuo YOTSUKURAShigeo MORISHIMANobuji TETSUTANI
Author information
JOURNAL FREE ACCESS

2001 Volume 30 Issue 4 Pages 452-463

Details
Abstract
In this paper, a front end system which uses audio and video information to track the people or other sound sources in the ordinary room has developed. The microphone array is used for determining the spatial location of the sound; the active video camera acquires the image of the area where the sound is detected, detects the people in the image by using skin color and can zoom and track a speaker. Several add-ons to the system include various visualization tools such as on-screen displays of waveforms, correlation plots, spectrum plots, spatial acoustic energy distribution, running time-frequency acoustic energy plots, and the possibility of real-time beamforming with real-time output to the headphones. The system can be used as a front-end for the non-encumbering human-computer interaction by video and audio means.
Content from these authors
© 2001 by the Institute of Image Electronics Engineers of Japan
Previous article Next article
feedback
Top