舌骨周囲の表面筋電図と深層学習を用いた無発声音声認識システムの開発

武田 将輝; 横田 紘季

doi:10.1299/jsmemecj.2024.J162p-12

抄録

Human Machine Interfaces (HMIs) have seen significant advancements recently, especially in aiding communication for patients with amyotrophic lateral sclerosis (ALS) and quadriplegia due to nerve damage. Among these, HMIs utilizing tongue movements have been proposed as a method leveraging relatively intact body parts. Specifically, weak electromyogram (EMG) signals during tongue motion serve as an effective input interface due to their higher signal-to-noise ratio compared to brain waves, allowing for stable measurements and reducing user burden. However, estimating consonants from EMG signals remains challenging. This study aims to develop a speech recognition system enabling non-vocal communication, utilizing EMG signals measured around the hyoid bone and leveraging advanced deep learning techniques. The proposed system integrates Convolutional Neural Networks (CNNs) for vowel estimation and Long Short-Term Memory (LSTM) networks for word prediction. The experimental methods include a detailed system overview, explanations of CNN and LSTM architectures, and a comparison of label estimation accuracy using different CNN parameters. Additionally, sequence data prediction using LSTM models is described. The experimental results demonstrate that combining the vowel estimation CNN with the word estimation LSTM yields a highly generalized model, enabling efficient and accurate non-vocal communication for patients. This HMI software provides a promising solution for enhancing patient communication capabilities.

著者関連情報

お気に入り & アラート

閲覧履歴

発行機関からのお知らせ

会員向け購読者番号とパスワードは以下URLよりご確認下さい。
https://www.jsme.or.jp/publication/proceedings/

前身誌

年次大会講演資料集

年次大会資料集

年次大会講演論文集

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）