Host: The Japanese Society for Artificial Intelligence
Name : The 37th Annual Conference of the Japanese Society for Artificial Intelligence
Number : 37
Location : [in Japanese]
Date : June 06, 2023 - June 09, 2023
Since dialogue systems are required to keep its speech style consistency, evaluating the similarity of speech styles is an important task. However, the Japanese language has a wide variety of speech styles, and the vocabulary and word usage characteristics of each speech style is vast, making it difficult to evaluate the speech style. Therefore, we propose a speech style embedding model that generates a style-sensitive vector. The speech style embedding model is constructed by fine-tuning a pre-trained BERT model using contrastive learning. Sentence pairs with similar and different speech styles, which are necessary for contrastive learning, are automatically collected on a large scale using a sequence of sentences in web novels. We also analyze the grouping of speech styles and the characteristic vocabulary and word usage of each speech style using Ward hierarchical clustering method. Finally, we focus on the variation in the speech style of the same person depending on the situation, and analyze the variation in the style-sensitive vectors of the same character in the novel.