ITE Transactions on Media Technology and Applications
Online ISSN : 2186-7364
ISSN-L : 2186-7364
Special Section on Image and Video Analysis, Search, and Benchmark
[Invited Paper] Object Pooling for Multimedia Event Detection and Evidence Localization
Hao ZhangChong-Wah Ngo
Author information
JOURNAL FREE ACCESS

2016 Volume 4 Issue 3 Pages 218-226

Details
Abstract
Multimedia event detection (MED) and evidence hunting are two primary topics in the area of multimedia event search. The former serves to retrieve a list of relevant videos given an event query, whereas, the latter reasons why and how much the degree a retrieved video answers that query. Common practices deal with these two topics in separate methods, however, in this paper, we combine MED and evidence hunting into a joint framework. We propose a refined semantical representation named object pooling which can dynamically extract visual snippets corresponding to the location of when and where evidences might appear. The main idea of object pooling is to adaptively sample regions from frames for generation of object histogram that can be efficiently rolled up and back. Experiments conducted on large-scale TRECVID MED 2014 dataset demonstrate the effectiveness of proposed object pooling approach on both event detection and evidence hunting.
Content from these authors
© 2016 The Institute of Image Information and Television Engineers
Previous article Next article
feedback
Top