Acoustical Science and Technology
Online ISSN : 1347-5177
Print ISSN : 1346-3969
ISSN-L : 0369-4232
TECHNICAL REPORT
Extraction of speech organ contours from ultrasound and real-time MRI data using DeepLabCut
Jing SunTatsuya KitamuraRyoko Hayashi
著者情報
ジャーナル オープンアクセス

2025 年 46 巻 4 号 p. 338-344

詳細
抄録

The analysis of articulatory movements, particularly tongue movements, is a key challenge in speech research. However, traditional methods for extracting tongue contours often face difficulties due to poor image quality and noise interference, complicating tongue motion analysis. This study proposes a novel approach to automatically extract the contours of the tongue and other articulatory organs from ultrasound and real-time magnetic resonance imaging (rtMRI) data. We employed DeepLabCut (DLC), a deep-learning-based tool. Our experiments demonstrated that DLC is not reliant on image edges or contrast, demonstrating robustness against noise and enabling effective automatic contour extraction. This paper highlights the method used and evaluates the accuracy of contour extraction for the tongue and other articulatory organs. By leveraging advanced deep-learning techniques, we aim to enhance the understanding of articulatory movements and improve speech analysis tools, ultimately contributing to enhanced outcomes in speech therapy and pronunciation training.

著者関連情報
© 2025 by The Acoustical Society of Japan

This article is licensed under a Creative Commons [Attribution-NoDerivatives 4.0 International] license.
https://creativecommons.org/licenses/by-nd/4.0/
前の記事 次の記事
feedback
Top