音声認識用言語モデルのためのタスク適応化と定型表現の利用

中川 聖一; 赤松 裕隆; 西崎 博光

doi:10.5715/jnlp.6.2_97

A task adaptation method and use of idiomatic expression of stochastic language model for speech recognition

SEIICHI NAKAGAWA, HIROTAKA AKAMATSU, HIROMITSU NISHIZAKI

Author information

Keywords: peech Recognition, Language Model, N-gram, Task Adaptation, Idiomatic Expression

JOURNAL FREE ACCESS

1999 Volume 6 Issue 2 Pages 97-115

DOI https://doi.org/10.5715/jnlp.6.2_97

Details

Abstract

In this paper, we describe a method that constructs language models using a taskadaptation strategy and idiomatic expressions of news articles. To build an effective N-gram based language model, it should be noted that the training data must be prepared as much as possible. However, for a given task/topic, it is very difficult to gather much data. First, we investigated the effect of a task adaptation method of N-gram language model using a limited amount of target articles. Second, we investigated the effect of the language model adaptation method using the latest articles. Third, we investigated the effect of the use of idiomatic expressions as morpheme units, since some specific expressions and idiomatic expressions are frequently observed in news articles. We show our proposed three methods are effective for constructing N-gram language models.

Corresponding author

Register with J-STAGE for free!