Proceedings of the Annual Conference of JSAI
Online ISSN : 2758-7347
35th (2021)
Session ID : 4I2-GS-7c-02
Conference information

Speaker-independent acoustic features extraction using StarGAN-VC and its applications for double articulation analysis
*Soichiro KOMURAKaede HAYASHIAkira TANIGUCHITadahiro TANIGUCHIHirokazu KAMEOKA
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Details
Abstract

Nonparametric Bayesian double articulation analyzer (NPB-DAA) is a method to discover words and phoneme units from continuous speech signals in an unsupervised manner. However, acoustic features have speaker-dependency, and it prevent NPB-DAA from discovering words and phonem units from multi-speaker utterances. This paper proposes to use star generative adversarial network for voice conversion (StarGAN-VC) to extract speaker-independent acoustic features and optimize NPB-DAA and StarGAN-VC simultaneously by using mutual learning based on Neuro-SERKET framework. The effect of mutual learning is shown through an experiment.

Content from these authors
© 2021 The Japanese Society for Artificial Intelligence
Previous article Next article
feedback
Top