年次大会
Online ISSN : 2424-2667
ISSN-L : 2424-2667
セッションID: S121-02
会議情報

複数人の音声データから各話者の感情を推定する手法
*岡崎 貫治綿貫 啓一
著者情報
会議録・要旨集 認証あり

詳細
抄録

Currently, many technologies have been proposed to identify speakers and transcribe speech from multi-person conversation data, such as meetings, and multiple services are commercially available from various companies. These services are continuously being improved, and the accuracy of speaker identification and transcription is also improving. Furthermore, some services attempt to estimate the emotions of individual speakers. However, such emotion estimation is limited to scenarios involving one-on-one audio data, such as call centers, and does not extend to estimating emotions from multi-person conversation data. Understanding whether a conversation was heated or not from multi-person conversation data remains limited to textual information, making it difficult to accurately infer the emotional context. Therefore, this study attempts to identify speakers from multi-person conversation data, separate or capture the audio data, and estimate the emotions of the identified speakers. As a result, it was possible to identify sections where any given speaker spoke alone and to estimate the emotions in those identified sections.

著者関連情報
© 2024 一般社団法人 日本機械学会
前の記事 次の記事
feedback
Top