Proceedings of the Annual Conference of JSAI
Online ISSN : 2758-7347
39th (2025)
Session ID : 1F4-OS-40b-05
Conference information

Zero-shot Visual Concept Blending without Text Guidance
*Hiroya MAKINOTakahiro YAMAGUCHIHiroyuki SAKAI
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Details
Abstract

In this study, we propose a novel image generation technique called “Visual Concept Blending”, which extracts features that are either shared or distinct features from multiple reference images and transfers them to a base image. By using multiple reference images, our method provides fine control over which features are blended into the base image. At the same time, it leverages the CLIP embedding space to facilitate the transfer of higher-level concepts such as shape transformation and motion. The proposed approach directly utilizes existing pre-trained models without requiring additional training, making it both simple and robust. This technique is expected to find wide-ranging applications in fields like art and design.

Content from these authors
© 2025 The Japanese Society for Artificial Intelligence
Previous article Next article
feedback
Top