Journal of the Acoustical Society of Japan (E)
Online ISSN : 2185-3509
Print ISSN : 0388-2861
ISSN-L : 0388-2861
Speech recognition based on the subspace method
AI class-description learning viewpoint
Yoichi Takebayashi
著者情報
ジャーナル フリー

1992 年 13 巻 6 号 p. 429-439

詳細
抄録
This paper describes the learning mechanism employed in a highly efficient user-adaptive speech recognizer based on the subspace method for large vocabulary Japanese test input. Comparing the subspace-based learning system with the well-known AI learning system ARCH, the following points are made:(1) Subspace learning using covariance matrix modification and KL-expansion is a kind of class-description learning from examples, as found in ARCH. The subspace learning method focusses on feature extraction, which results in a powerful representation of pattern characteristics for each pattern class, but does not involve only pattern classification, unlike conventional pattern recognition methods.(2) The concepts of “Near-Miss, ” “Require-Link” and “Forbid-Link” in ARCH can be simulated with the subspace method. Since the subspace method deals with patterns but not symbols, it does not need pattern-symbol conversion. In other words, the subspace learning method has a more versatile description capability than ARCH.(3) Minsky's concept of “Uniframe” is implemented in a speech recognizer based on the subspace method. The “Uniframe” obtained with KL-expansion is equivalent to a subspace which represents a meaning of a class. Minsky's “Accumula-tion” and “Exceptional Principal” concepts have also been taken into account.
著者関連情報
© The Acoustical Society of Japan
前の記事
feedback
Top