ITE Transactions on Media Technology and Applications
Online ISSN : 2186-7364
ISSN-L : 2186-7364
Special Section on Image and Video Analysis, Search, and Benchmark
[Invited Paper] Semantic Indexing for Large-Scale Video Retrieval
Nakamasa InoueKoichi Shinoda
著者情報
ジャーナル フリー

2016 年 4 巻 3 号 p. 209-217

詳細
抄録
Video semantic indexing, which aims to detect objects, actions and scenes from video data, is one of important research topics in multimedia information processing. In the Text Retrieval Conference Video Retrieval Evaluation (TRECVID) workshop, many fundamental techniques for video processing have been developed and have been shown to be effective for real data such as Internet videos. They include extensions of deep learning techniques and image recognition techniques such as bag of visual words to video data. This paper reviews TRECVID activities with these techniques for semantic indexing. We also show the TokyoTech system using Gaussian-mixture-model (GMM) supervectors and deep convolutional neural networks (CNNs) with its experimental evaluation at TRECVID 2014.
著者関連情報
© 2016 The Institute of Image Information and Television Engineers
前の記事 次の記事
feedback
Top