Multi-modal Few-shot One-class Image Classification の為のメタラーニング手法の提案

大熊 拓海; 中山 英樹

doi:10.11517/pjsai.JSAI2021.0_4I1GS7b02

Abstract

One-class image classification is the task of discriminating whether images belong to a certain class, and this task is important for the recognition of certain visual concepts. Human is good at solving this task with only a few data, and the performance of few-shot learning methods of previous works is much less than that of human. To improve the performance, we propose ``Multi-modal Belongingness Network (MMBeNet)'', which is an extended model of ``Belongingness Network (MMBeNet) \cite{BeNet}'', to use not only a few image data but also semantic information such as attributes and word vector, and call this task ``multi-modal few-shot one-class image classification’’. We consider that semantic information is an important factor of the high ability of humans and confirm that it is effective for this task to improve the performance through experiments. Besides, MMBeNet can solve not only multi-modal tasks but also image-only few-shot and zero-shot tasks by a single model.

Content from these authors

Favorites & Alerts

Corresponding author

Conference information

Register with J-STAGE for free!