Recent Advances in Video Action Recognition with 3D Convolutions

Kensho HARA

doi:10.1587/transfun.2020IMP0012

この記事には本公開記事があります。本公開記事を参照してください。
引用する場合も本公開記事を引用してください。

Recent Advances in Video Action Recognition with 3D Convolutions

Kensho HARA

著者情報

キーワード: Video recognition, Action recognition, 3D convolutions, Survey

ジャーナルフリー早期公開

論文ID: 2020IMP0012

DOI https://doi.org/10.1587/transfun.2020IMP0012

この記事には本公開記事があります。

The final version of this article is now available: Vol. E104.A (2021), No. 6 pp. 846-856

詳細

抄録

The performance of video action recognition has improved significantly in recent decades. Current recognition approaches mainly utilize convolutional neural networks to acquire video feature representations. In addition to the spatial information of video frames, temporal information such as motions and changes is important for recognizing videos. Therefore, the use of convolutions in a spatiotemporal three-dimensional (3D) space for representing spatiotemporal features has garnered significant attention. Herein, we introduce recent advances in 3D convolutions for video action recognition.

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）