IEEJ Transactions on Electronics, Information and Systems
Online ISSN : 1348-8155
Print ISSN : 0385-4221
ISSN-L : 0385-4221
<Sofuware and Information Processing>
Retrieval of Relevant Parts of Document Images Based on 2D Density Distributions of Characters
Koichi KiseMasaaki TsujinoKeinosuke Matsumoto
Author information
JOURNAL FREE ACCESS

2005 Volume 125 Issue 1 Pages 113-119

Details
Abstract
This paper presents a new method of document image retrieval that is capable of spotting parts of document images relevant to users' queries. This enables us to improve effectiveness and usability of retrieval, since users are relieved from burdens of finding relevant parts in retrieved documents. The proposed method is based on the assumption that parts of document images which densely contain characters in queries are relevant to them. For the purpose of ranking relevant parts, two-dimensional density distributions of characters are calculated based on layout features such as locations of characters and distance to the nearest characters. Based on the experimental results of retrieving Japanese newspaper articles, it is shown that the proposed method is superior to a method without a function of retrieving the parts.
Content from these authors
© 2005 by the Institute of Electrical Engineers of Japan
Previous article Next article
feedback
Top