Journal of Information Processing
Online ISSN : 1882-6652
ISSN-L : 1882-6652
Compressing Inverted Index Using Optimal FastPFOR
Veluchamy GlorySandanam Domnic
著者情報
ジャーナル フリー

2015 年 23 巻 2 号 p. 185-191

詳細
抄録

Indexing plays an important role for storing and retrieving the data in Information Retrieval System (IRS). Inverted Index is the most frequently used indexing structure in IRS. In order to reduce the size of the index and retrieve the data efficiently, compression schemes are used, because the retrieval of compressed data is faster than uncompressed data. High speed compression schemes can improve the performance of IRS. In this paper, we have studied and analyzed various compression techniques for 32-bit integer sequences. The previously proposed compression schemes achieved either better compression rates or fast decoding, hence their decompression speed (disk access + decoding) might not be better. In this paper, we propose a new compression technique, called Optimal FastPFOR, based on FastPFOR. The proposed method uses better integer representation and storage structure for compressing inverted index to improve the decompression performance. We have used TREC data collection in our experiments and the results show that the proposed code could achieve better compression and decompression compared to FastPFORand other existing related compression techniques.

著者関連情報
© 2015 by the Information Processing Society of Japan
前の記事 次の記事
feedback
Top