Host: Japan Society for Fuzzy Theory and Intelligent Info rmatics (SOFT)
Name : 40th Fuzzy System Symposium
Number : 40
Location : [in Japanese]
Date : September 02, 2024 - September 04, 2024
In recent years, the importance of a speech segment detection technique called speaker di-(breakpoint)arization has increased, mainly in conferences, news, and telephone calls. However, conventional speaker segmentation detection methods using neural networks require a huge amount of training data. In this study, training data was created by recording the speech of two speakers of the same gender, splitting and combining them to create a synthetic utterance. The effect of the distance and angle to the microphone on the accuracy of the test data was examined in tests with non-synthesized speech.