Proceedings of the Annual Conference of JSAI
Online ISSN : 2758-7347
37th (2023)
Session ID : 1O3-GS-7-01
Conference information

Automatic Production of Audio Descriptions Using Image Recognition for Live Baseball Broadcasts
*Yuki SHIMANOYuya KUWANOMasaki TAKAHASHIMasaru MIYAZAKIMasanori SANOAtsushi IMAIToru TAKAGI
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Details
Abstract

Audio descriptions enable a visually impaired audience to enjoy broadcast programs by providing supplementary information such as a person’s actions and facial expressions that are difficult for such audiences to understand from the main audio content. Although such descriptions would be ideal for the live sporting broadcasts, the production of audio descriptions for such events requires high production costs and expert commentary skills. We thus developed a system that creates audio descriptions of live baseball broadcasts and distributes them to users' smartphones in real time.These audio descriptions are created from the superimposed captions of baseball broadcasts automatically by using image recognition.The experimental results indicate that the proposed method recognizes information of superimposed captions and robustly produces audio descriptions in real time.

Content from these authors
© 2023 The Japanese Society for Artificial Intelligence
Previous article Next article
feedback
Top