ヒューマンインタフェース学会論文誌
Online ISSN : 2186-8271
Print ISSN : 1344-7262
ISSN-L : 1344-7262
特集論文「若手研究者6」
Advancing Human-Computer Interaction: End-to-End Sign Language Translation.
Sihan TanKatsutoshi ItoyamaKazuhiro Nakadai
著者情報
ジャーナル フリー HTML

2024 年 26 巻 4 号 p. 391-398

詳細
抄録

Despite recent successes in nonverbal human-computer interaction (HCI) facilitated by deep learning methods, sign language translation for HCI remains underexplored. In this paper, we analyze and develop a sign language translation system that can recognize continuous signs and convert the sign meanings into natural spoken sentences in an end-to-end manner. We believe this system will enhance the interaction between computers and “deaf and hard-of-hearing individuals”. In developing this sign language translation system, we introduced high-quality sign embedding to extract informative spatial-temporal representation from continuous sign motions and adopted label smoothing to the training criteria to mitigate the overfitting issue. The proposed methods, therefore, help narrow the modality gap between vision (sign language) and language (spoken sentence). We conducted the experiments with the proposed methods on the PHOENIX14T dataset, yielding significantly improved results (WER↓: 29.20→20.79, BLEU-4↑: 21.12→ 24.56).

著者関連情報
© Non-Profit Organization, Human Interface Society
前の記事 次の記事
feedback
Top