JSAI Technical Report, Type 2 SIG
Online ISSN : 2436-5556
Pseudo Labled Latent Dirichlet Allocation
Satoko SUZUKIIchiro KOBAYASHI
Author information
RESEARCH REPORT / TECHNICAL REPORT FREE ACCESS

2013 Volume 2013 Issue AM-03 Pages 04-

Details
Abstract

In recent years, topic models have been widely used for many applications such as document summarization, document clustering etc. Labeled latent Dirichlet allocation (LLDA) was proposed based on latent Dirichlet allocation (LDA), and it regards the tags, i.e., labels, put on documents by humans as the ones expressing the contents of the documents, and uses them as supervised information to estimate latent topics of the documents. Moreover, it is reported that LLDA exceeds the ability of LDA in terms of topic estimation. However, normal documents usually do not have such tags with them, so, the use of LLDA is considerably limited.In this study, therefore, we make pseudo labels from the documents to be estimated their latent topics instead of tags put on documents by humans, and aim to make LLDA available for all documents.

Content from these authors
© 2013 Authors
Previous article Next article
feedback
Top