Host: The Japanese Society for Artificial Intelligence
Name : The 35th Annual Conference of the Japanese Society for Artificial Intelligence
Number : 35
Location : [in Japanese]
Date : June 08, 2021 - June 11, 2021
One-class image classification is the task of discriminating whether images belong to a certain class, and this task is important for the recognition of certain visual concepts. Human is good at solving this task with only a few data, and the performance of few-shot learning methods of previous works is much less than that of human. To improve the performance, we propose ``Multi-modal Belongingness Network (MMBeNet)'', which is an extended model of ``Belongingness Network (MMBeNet) \cite{BeNet}'', to use not only a few image data but also semantic information such as attributes and word vector, and call this task ``multi-modal few-shot one-class image classification’’. We consider that semantic information is an important factor of the high ability of humans and confirm that it is effective for this task to improve the performance through experiments. Besides, MMBeNet can solve not only multi-modal tasks but also image-only few-shot and zero-shot tasks by a single model.