IEEJ Transactions on Electronics, Information and Systems
Online ISSN : 1348-8155
Print ISSN : 0385-4221
ISSN-L : 0385-4221
<Information Processing, Software>
A Method for High Throughput Deduplication for Primary File Servers by Using Pre-fetch Cache
Hitoshi KameiTakaki Nakamura
Author information
JOURNAL FREE ACCESS

2015 Volume 135 Issue 6 Pages 619-628

Details
Abstract
We propose a method of high throughput file level deduplication for primary file servers, called partial data background pre-fetch (PDBP). To achieve high throughput of deduplication, the method reduces the number of disk I/Os issued during deduplication process. Before running deduplication process, the proposed method pre-fetches a part of data of shared files referred by deduplicated files. After that, the method processes the files that are larger than a file size threshold defined by administrators.  In this paper, we evaluate a deduplication processing time by using a simulation model of PDBP. Consequently, we confirm that the processing time of PDBP is reduced by about 50 % compared to a conventional file deduplication method when the threshold is set to 4 KB.
Content from these authors
© 2015 by the Institute of Electrical Engineers of Japan
Previous article Next article
feedback
Top