人工知能学会第二種研究会資料
Online ISSN : 2436-5556
単語の共起と出現頻度に着目した文書の索引付け
奥井 颯平猪口 明博
著者情報
研究報告書・技術報告書 フリー

2015 年 2015 巻 DOCMAS-009 号 p. 02-

詳細
抄録

In this paper, we propose two models to weight each term in the document for document retrieval. Our idea of the models come from traditional Term Frequencies (TFs) and Term Weights (TWs) proposed in 2013. TF is based on the number of term occurrences in a document and used as de facto standard. On the other hand, TW is based on variation of term co-occurrences in a document and outperforms to TF. Our proposed models give much weight to terms which cooccur with terms frequently occur. We show experimental results comparing to the conventional models using a very large text corpus.

著者関連情報
© 2015 著作者
次の記事
feedback
Top