Proceedings of the Annual Conference of JSAI
Online ISSN : 2758-7347
38th (2024)
Session ID : 3F1-GS-10-05
Conference information

Towards Automatic Generation of Graphic Layout with Large Multimodal Models
*Limin WANGSatoshi WAKIToyotaro SUZUMURA
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Details
Abstract

Given the recent advancement of generative models, it has become possible that AI instead of humans generates graphic layouts. Among existing methods for layout generation, some utilize not only the information of each element but also constraints such as the relationships between elements. However, these methods often require humans to specify the constraints, which can be burdensome. Additionally, they have the limitation of only considering the category information of layout elements like “image”, “text”, “title”, and so on, without taking into account the detailed content within those images or text. Thus, this study proposes a method that leverages the detailed content of elements and automatically generate constraints that will be used for layout generation. Since elements can be either images or text, we explore the use of large multimodal models for extracting detailed content. This approach leads to the automatic generation of graphic layouts with less need for extensive human input.

Content from these authors
© 2024 The Japanese Society for Artificial Intelligence
Previous article Next article
feedback
Top