複数のマイクロホンアレイによる音源方向情報と人位置情報に基づく音声区間検出および顔の向きの推定の評価

石井 カルロス寿憲; エヴァン イアニ; 萩田 紀博

doi:10.7210/jrsj.34.199

Paper

Evaluation of Speech Interval Detection and Face Orientation Estimation based on Sound Directions by Multiple Microphone Arrays and Human Positions

Carlos T. Ishi, Jani Even, Norihiro Hagita

Author information

Keywords: Speech Activity Detection, Sound Localization, Microphone Array, Human Tracking, Environment Sensor Network, Real-time Processing

JOURNAL FREE ACCESS

2016 Volume 34 Issue 3 Pages 199-204

DOI https://doi.org/10.7210/jrsj.34.199

Details

Abstract

We developed a system for detecting the speech intervals of multiple speakers and estimating the face orientation during the detected speech intervals by combining information of sound directions from multiple microphone arrays and human positions. The developed system was evaluated in three conditions: individual utterances in different positions and orientations, simultaneous dialogues by multiple speakers, and moving sources. Evaluation results revealed that the proposed system could detect speech intervals with more than 90% accuracy, and face orientations with mean absolute errors around 20 degrees, in situations excluding the cases where all arrays are in the opposite direction to the speaker's face orientation.

Corresponding author

Register with J-STAGE for free!