映像情報メディア学会冬季大会講演予稿集
Online ISSN : 2424-2306
Print ISSN : 1343-4357
ISSN-L : 1343-4357
2017 ITE Winter Annual Convention
セッションID: 13B-3
会議情報

A Statistical Study of Late Fusion Strategy in Two-stream ConvNets for Human Action Recognition
*Jianfeng XUKazuyuki TASAKAHiromasa YANAGIHARA
著者情報
会議録・要旨集 フリー

詳細
抄録
Serving as the base of many advanced networks for human action recognition in videos, two-stream ConvNets have shown strong performance on a commonly used dataset, UCF101. This paper statistically analyzes the late fusion strategy in two-stream ConvNets from a frame level to a video level. We report the characteristics of the temporal domain on a frame level, which is called as a domino-like effect and explains well why an effective temporal fusion is difficult to design. For the fusion of two streams, it has been reported that a proper weight can improve performance substantially. However, here we will provide a different reason from the original report and propose a method for calculating an effective weight.
著者関連情報
© 2017 The Institute of Image Information and Television Engineers
前の記事 次の記事
feedback
Top