Proceedings of the Annual Conference of JSAI
Online ISSN : 2758-7347
37th (2023)
Session ID : 4Xin1-78
Conference information

Edge Devices-Friendly Dynamic Sign Language Recognition System using Attention Module
*Yuejie MENGMasao YANAGISAWAYouhua SHI
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Details
Abstract

In recent years, people’s life is becoming more and more convenient due to voice assistants like Siri, adopting artificial intelligence (AI) techniques. However, hearing-impaired people, especially those who cannot speak, are unable to have the benefits of this technology for physical reasons. Gesture recognition techniques using deep learning would be a hopeful alternative to help them. However, many previous studies used 3D-CNN or CNN+LSTM to recognize gestures from images or from videos, which requires large memory. In order to solve this problem, this paper proposes a gesture recognition model based on Transformer called DGT-STA. This model is able to achieve accuracy beyond that of 3D-CNN with a shallower neural network, and reduced memory usage to 50.91% compared to models using other Attention modules. In addition, a dataset of Japanese Sign Language is built to train and evaluate DGT-STA. Finally, this paper verified that it is feasible to deploy DGT-STA on IoT edge devices.

Content from these authors
© 2023 The Japanese Society for Artificial Intelligence
Previous article Next article
feedback
Top