9-1 メタデータ生成のための音声認識の改善(第9部門 メディア認識と評価I)

佐藤 庄衛; 小林 彰夫; 尾上 和穂; 山田 一郎; 佐野 雅規; 今井 亨

doi:10.11485/iteac.2005.0__9-1-1_

2005

Session ID : 9-1

DOI https://doi.org/10.11485/iteac.2005.0__9-1-1_

Conference information

Name : 2005 ITE Annual Convention

Location : [in Japanese]

Date : August 24, 2005 - August 26, 2005

9-1 Improvement of speech recognition for metadata generation

Shoei Sato, Akio Kobayashi, Kazuo Onoe, Ichiro Yamada, Masanori Sano, Toru Imai

Author information

CONFERENCE PROCEEDINGS FREE ACCESS

Details

Abstract

A future broadcasting system will broadcast along with regular programming a wide variety of program-related information called metadata. Our speech recognition technology is applied to extract information that corresponds to highlights from the commentary of a football game. The proposed method uses additional phoneme models of excitedly uttered words which may be important key words for highlight extraction. The proposed method obtained 14% of key word error reduction, which improved extraction accuracy of scenes making a shoot by 18% in precision and 17% in recall.

Corresponding author

Register with J-STAGE for free!