残差駆動型アーキテクチャの提案と音響ストリーム分離への応用

中谷 智広; 後藤 真孝; 川端 豪; 奥乃 博

doi:10.11517/jjsai.12.1_111

抄録

This paper presents the Residue-Driven Architecture (RDA) as a general computational frame-work for sound stream segregation based on a multi-agent paradigm. Sound stream segregation is an important primary processing for computationally understanding sounds (Computational Auditory Scene Analysis) in the real-world. Since RDA is designed without assuming any specific sound attributes, it can be applied to various kinds of sound stream segregation problems. The RDA consists of three kinds of agents : an event-detector, a tracer-generator, and tracers. The event-detector calculates a residue by subtracting the predicted input from the actual input. When a residue exceeds a threshold value, tracer-generator generates a tracer that extracts a sound stream from the residue and returns a predicted input of the next time frame to the event-detector. The RDA is applied to the design of two subsystems : A monaural subsystem segregates sound streams under background noise using harmonic structure ; a binaural subsystem refines the sound streams segregated by the monaural system using the direction of the sound source. These subsystems can be concisely designed and simply implemented based on the RDA ; therefore, the effectiveness of the RDA is proven. In addition, experimental results show that the capability of the sound stream segregation system is improved by combining these subsystems.

著者関連情報

お気に入り & アラート

閲覧履歴

発行機関からのお知らせ

PDF閲覧時に認証を求められる記事がございます（発行後2年間）が，人工知能学会の個人会員は無料で閲覧可能です．認証のための購読者番号やパスワードは会員マイページ（ユース会員の場合はジュニア・ユース会員サイト）にログインし「お知らせ」にてご確認下さい（会員情報管理システムとオンラインで連携していないため，パスワードは同システムとは異なります．また，認証情報の更新は偶数月の月末に実施しております．新規入会された方は利用できるまでしばらくお待ちください）．個人会員以外は記事複製申込フォームから購入いただけます．また，アマゾンにて冊子版あるいはKindle版を購入いただくことも可能です．

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）