JSAI Technical Report, SIG-SLUD
Online ISSN : 2436-4576
Print ISSN : 0918-5682
99th (Dec.2023)
Conference information

A Non-contact Respiratory Waveform Estimation Method for Improving Compatibility of Dialogue Systems
Takao OBIFunakoshi KOTARO
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Pages 07-12

Details
Abstract

Respiration is closely related to speech, so respiratory information is useful for improving multimodal spoken dialogue systems from various perspectives. A machine-learning task is presented for multimodal spoken dialogue systems to improve the compatibility of the systems and promote smooth interaction with them. This task consists of two subtasks: waveform amplitude estimation and waveform gradient estimation. A dataset consisting of respiratory data for 30 participants was created for this task, and a strong baseline method based on 3DCNN-ConvLSTM was evaluated on the dataset. Finally, our task was shown to be effective in predicting user voice activity after 200 ms. These results suggest that our task is effective for improving multimodal spoken dialogue systems.

Content from these authors
© 2023 The Japaense Society for Artificial Intelligence
Previous article Next article
feedback
Top