電気学会論文誌C(電子・情報・システム部門誌)
Online ISSN : 1348-8155
Print ISSN : 0385-4221
ISSN-L : 0385-4221
<音声画像処理・認識>
2つのスペクトログラムを用いた画像処理による混合音声の分離に関する研究
樋口 寛晃旭 健作佐川 雄二杉江 昇
著者情報
ジャーナル フリー

2004 年 124 巻 12 号 p. 2439-2445

詳細
抄録
We propose a method for separating speeches using two spectrograms. First, two spectrograms are generated from voices recorded with a pair of microphones. The onsets and the offsets of the frequency components are extracted as the features using image processing techniques. Then the correspondences of the features between the spectrograms are determined and the intermicrophone time differences are calculated. Each of frequency components with the common onset/offset occurrences and time difference are grouped together as originating one of the speech signals. A set of band-pass filters are generated corresponding to each group of frequency components. Finally, each of the separated speech signals is extracted by applying the set of band-pass filters to the voice signal recorded by a microphone. Experiments were conducted with the mixture of a male speech sound and a female speech sound consisting of Japanese vowel and contain consonant. The evaluation results demonstrated that the separation was done reasonably well with the proposed method.
著者関連情報
© 電気学会 2004
前の記事 次の記事
feedback
Top