ITE Transactions on Media Technology and Applications
Online ISSN : 2186-7364
ISSN-L : 2186-7364
Special Section on Image and Video Analysis, Search, and Benchmark
[Paper] Visual Instance Retrieval with Deep Convolutional Networks
Ali S. RazavianJosephine SullivanStefan CarlssonAtsuto Maki
著者情報
ジャーナル フリー

2016 年 4 巻 3 号 p. 251-258

詳細
抄録
This paper provides an extensive study on the availability of image representations based on convolutional networks (ConvNets) for the task of visual instance retrieval. Besides the choice of convolutional layers, we present an efficient pipeline exploiting multi-scale schemes to extract local features, in particular, by taking geometric invariance into explicit account, i.e. positions, scales and spatial consistency. In our experiments using five standard image retrieval datasets, we demonstrate that generic ConvNet image representations can outperform other state-of-the-art methods if they are extracted appropriately.
著者関連情報
© 2016 The Institute of Image Information and Television Engineers
前の記事 次の記事
feedback
Top