The Proceedings of JSME annual Conference on Robotics and Mechatronics (Robomec)
Online ISSN : 2424-3124
2019
Session ID : 2A2-D07
Conference information

Monaural Speech Enhancement for a Hose-Shaped Rescue Robot with Air-Jet Noise
*Yoshiaki bandoYuichi AmbeKatsutoshi ItoyamaMasashi KonyoSatoshi TadokoroKazuhiro NakadaiHiroshi G. Okuno
Author information
CONFERENCE PROCEEDINGS RESTRICTED ACCESS

Details
Abstract

This paper presents a monaural speech enhancement method for a hose-shaped rescue robot based on a deep speech prior. Speech enhancement is crucial to make a robot operator succeed in detecting human voices because audio signals captured by a microphone on the robot are contaminated by ego-noise. We have been developed three enhancement methods: 1) a blind speech enhancement called robust nonnegative matrix factorization (RNMF), 2) an extension of RNMF with a pre-trained noise model, and 3) another extension of RNMF with a deep speech prior, i.e., a pre-trained speech model based on deep learning. In this paper, we develop a new extension of RNMF by combining the pre-trained noise and speech models as a unified model and evaluated these methods on a hose-shaped rescue robot whose ego-noise consists of vibration-motor and air-jet noise. Experimental results show that the new method outperforms the three RNMF methods when the signal-to-noise ratio is equal to or less than +5 dB.

Content from these authors
© 2019 The Japan Society of Mechanical Engineers
Previous article Next article
feedback
Top