ITE Transactions on Media Technology and Applications
Online ISSN : 2186-7364
ISSN-L : 2186-7364
Special Section on Image and Video Analysis, Search, and Benchmark
[Invited Paper] Object Pooling for Multimedia Event Detection and Evidence Localization
Hao ZhangChong-Wah Ngo
著者情報
ジャーナル フリー

2016 年 4 巻 3 号 p. 218-226

詳細
抄録

Multimedia event detection (MED) and evidence hunting are two primary topics in the area of multimedia event search. The former serves to retrieve a list of relevant videos given an event query, whereas, the latter reasons why and how much the degree a retrieved video answers that query. Common practices deal with these two topics in separate methods, however, in this paper, we combine MED and evidence hunting into a joint framework. We propose a refined semantical representation named object pooling which can dynamically extract visual snippets corresponding to the location of when and where evidences might appear. The main idea of object pooling is to adaptively sample regions from frames for generation of object histogram that can be efficiently rolled up and back. Experiments conducted on large-scale TRECVID MED 2014 dataset demonstrate the effectiveness of proposed object pooling approach on both event detection and evidence hunting.

著者関連情報
© 2016 The Institute of Image Information and Television Engineers
前の記事 次の記事
feedback
Top