Acoustical Science and Technology
Online ISSN : 1347-5177
Print ISSN : 1346-3969
ISSN-L : 0369-4232

この記事には本公開記事があります。本公開記事を参照してください。
引用する場合も本公開記事を引用してください。

Extraction of Speech Organ Contours from Ultrasound and real-time MRI Data using DeepLabCut
Jing SunTatsuya KitamuraRyoko Hayashi
著者情報
ジャーナル オープンアクセス 早期公開

論文ID: e24.128

この記事には本公開記事があります。
詳細
抄録

The analysis of articulatory movements, particularly tongue movements, is a key challenge in speech research. However, traditional methods for extracting tongue contours often face difficulties due to poor image quality and noise interference, complicating tongue motion analysis. This study proposes a novel approach to automatically extract the contours of the tongue and other articulatory organs from ultrasound and real-time magnetic resonance imaging (rtMRI) data. We employed DeepLabCut (DLC), a deep-learning-based tool. Our experiments demonstrated that DLC is not reliant on image edges or contrast, demonstrating robustness against noise and enabling effective automatic contour extraction. This paper highlights the method used and evaluates the accuracy of contour extraction for the tongue and other articulatory organs. By leveraging advanced deep-learning techniques, we aim to enhance the understanding of articulatory movements and improve speech analysis tools, ultimately contributing to enhanced outcomes in speech therapy and pronunciation training.

著者関連情報
© 2025 by The Acoustical Society of Japan

This article is licensed under a Creative Commons [Attribution-NoDerivatives 4.0 International] license.
https://creativecommons.org/licenses/by-nd/4.0/
feedback
Top