Reports of the Technical Conference of the Institute of Image Electronics Engineers of Japan
Online ISSN : 2758-9218
Print ISSN : 0285-3957
Reports of the 243rd Technical Conference of the Institute of Image Electronics Engineers of Japan
Session ID : 08-05-09
Conference information

15:30-17:02 Chair: Naoki KOBAYASHI, Saitama Medical University
Visual Recognition of Spoken Words Using Optical Flow
*Ryota NAKAMURAShigeru AKAMATSU
Author information
CONFERENCE PROCEEDINGS RESTRICTED ACCESS

Details
Abstract

This paper describes an automatic vision-based spoken word recognition system that utilizes, instead of audio signal, visual motion signal which is obtained from motion pictures taken of a region around the mouth during speech. Motion information on each pixel in the input time-series imagery was obtained by computation of optical flow, and feature values representing a spatial configuration of pixel-wise velocities were extracted for each frame image. Both starting and ending points of time for each spoken word were defined using the velocity feature values, and a high dimensional feature vector was obtained to indicate time variation of the velocity distribution within the period of utterance. As a preliminary performance evaluation of the proposed feature in spoken word recognition, discrimination test of five spoken words including A-RI-GA-TO-U and KO-N-NI-CHI-WA was conducted, and fairly promising results were achieved.

Content from these authors
© 2009 by The Institute of Image Electronics Engineers of Japan
Previous article Next article
feedback
Top