フィルター・パラメータの重要度に着目した小規模タスクのためのCNN初期化手法

加藤 ジェーン; 張 冠文; ワン ユ

doi:10.1527/tjsai.C-G42

Abstract

Deep Convolutional Neural Networks (CNNs) have achieved great success in many computer vision tasks. However, it is still difficult to use them in practical tasks, especially small scale tasks, because of the large quantity of labeled training data that are required in their training process. In this paper, we present two approaches to enable easy adaption of CNNs in small scale tasks: the Minimum Entropy Loss (MEL) approach and the Minimum Reconstruction Error (MRE) approach. The basic idea of these two approaches is to select informative filters in pre-trained CNN models, and reuse them to initialize CNNs that are designed for small scale tasks. Different with popular fine-turning approach which also reuses pre-trained CNNs by conducting further training without changing their model architectures, MEL and MRE lead to an easy usage of pre-trained models in novel model architectures. This makes it a high flexibility when dealing with small scale tasks. We evaluated the performance of the two approaches on practicalsmall scale tasks, and confirmed their high performance and high flexibility.

Content from these authors

Favorites & Alerts

Corresponding author

Register with J-STAGE for free!