人工知能
Online ISSN : 2435-8614
Print ISSN : 2188-2266
人工知能学会誌(1986~2013, Print ISSN:0912-8085)
残差駆動型アーキテクチャの提案と音響ストリーム分離への応用
中谷 智広後藤 真孝川端 豪奥乃 博
著者情報
解説誌・一般情報誌 フリー

1997 年 12 巻 1 号 p. 111-120

詳細
抄録

This paper presents the Residue-Driven Architecture (RDA) as a general computational frame-work for sound stream segregation based on a multi-agent paradigm. Sound stream segregation is an important primary processing for computationally understanding sounds (Computational Auditory Scene Analysis) in the real-world. Since RDA is designed without assuming any specific sound attributes, it can be applied to various kinds of sound stream segregation problems. The RDA consists of three kinds of agents : an event-detector, a tracer-generator, and tracers. The event-detector calculates a residue by subtracting the predicted input from the actual input. When a residue exceeds a threshold value, tracer-generator generates a tracer that extracts a sound stream from the residue and returns a predicted input of the next time frame to the event-detector. The RDA is applied to the design of two subsystems : A monaural subsystem segregates sound streams under background noise using harmonic structure ; a binaural subsystem refines the sound streams segregated by the monaural system using the direction of the sound source. These subsystems can be concisely designed and simply implemented based on the RDA ; therefore, the effectiveness of the RDA is proven. In addition, experimental results show that the capability of the sound stream segregation system is improved by combining these subsystems.

著者関連情報
© 1997 人工知能学会
前の記事 次の記事
feedback
Top