Journal of Natural Language Processing
Online ISSN : 2185-8314
Print ISSN : 1340-7619
ISSN-L : 1340-7619
Paper
Language Model Estimation from a Stochastically Tagged Corpus
Shinsuke MoriTetsuro SasadaGraham Neubig
Author information
JOURNAL FREE ACCESS

2011 Volume 18 Issue 2 Pages 71-87

Details
Abstract
In this paper, first we propose a language model based on pairs of word and input sequence. Then we propose the notion of a stochastically tagged corpus to cope with tag estimation errors. The experimental results we conducted using kana-kanji converters showed that our ideas, the language model based on pairs of word and input sequence and the notion of a stochastically tagged corpus, both improved the accuracy. Therefore we conclude that the language model based on pairs and the notion of a stochastically tagged corpus are effective in language modeling for the kana-kanji conversion task.
Content from these authors
© 2011 The Association for Natural Language Processing
Previous article Next article
feedback
Top