Proceedings of the Annual Conference of JSAI
Online ISSN : 2758-7347
35th (2021)
Session ID : 4I1-GS-7b-02
Conference information

Meta-learning Method for Multi-modal Few-shot One-class Image Classification
*Takumi OHKUMAHideki NAKAYAMA
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Details
Abstract

One-class image classification is the task of discriminating whether images belong to a certain class, and this task is important for the recognition of certain visual concepts. Human is good at solving this task with only a few data, and the performance of few-shot learning methods of previous works is much less than that of human. To improve the performance, we propose ``Multi-modal Belongingness Network (MMBeNet)'', which is an extended model of ``Belongingness Network (MMBeNet) \cite{BeNet}'', to use not only a few image data but also semantic information such as attributes and word vector, and call this task ``multi-modal few-shot one-class image classification’’. We consider that semantic information is an important factor of the high ability of humans and confirm that it is effective for this task to improve the performance through experiments. Besides, MMBeNet can solve not only multi-modal tasks but also image-only few-shot and zero-shot tasks by a single model.

Content from these authors
© 2021 The Japanese Society for Artificial Intelligence
Previous article Next article
feedback
Top