JSAI Technical Report, SIG-SLUD
Online ISSN : 2436-4576
Print ISSN : 0918-5682
98th (Sep.2023)
Conference information

Laughter Detection using Pre-trained Automatic Speech Recognition Model
Takashi USHIOYosuke HIGUCHITaku KUHARAHaruo FUJIWARAHiroshi KATO
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Pages 59-65

Details
Abstract

In recent years, remote meeting has become widely used, and solutions for analyzing spoken dialogue are becoming more widespread.For example, in the context of business negotiations, a key focus lies in developing solutions to assess conversational skills through the extraction and summarization of linguistic features. Additionally, paralinguistic information holds significant value in analyzing spoken dialogue as it offers insights into a speaker's impressions and emotions,manifesting in various conversational cues such as laughter, filler words, and gasps. In this work, for the purpose of detecting laughter, we propose a detection system based on a pre-trained speech recognition model and a semi-supervised learning method using weakly labeled data in which the laughter interval is unclear. We conduct experiments on a corpus of Japanese conversation and show that the proposed method outperforms conventional methods. Moreover, we discuss the diversity of speakers, cultures, and sound collection environments as factors affecting the laughter detection performance.

Content from these authors
© 2023 The Japaense Society for Artificial Intelligence
Previous article Next article
feedback
Top