Stable Diffusion に基づく視覚・言語融合における3DCG生成画像の品質評価に関する基礎的検討

河畑 則文

doi:10.11371/wiieej.23.04.0_288

Reports of the 308th Technical Conference of the Institute of Image Electronics Engineers of Japan

Session ID : 23-04-085

DOI https://doi.org/10.11371/wiieej.23.04.0_288

Conference information

Host: The Institute of Image Electronics Engineers of Japan

Co-host: The Institute of Image Information and Television Engineers, The Society for Art & Science, The Computer Graphic Arts Society(CG-ARTS)

Name : Reports of the 308th Technical Conference of the Institute of Image Electronics Engineers of Japan

Number : 308

Location : [in Japanese]

Date : March 05, 2024 -

A Fundamental Study on 3D CG Image Quality Assessment in Vision & Language Based on Stable Diffusion

*Norifumi KAWABATA

Author information

Keywords: Image Generation AI, Diffusion Model, Vision and Language, image-to-image, Image Quality Assessment

CONFERENCE PROCEEDINGS RESTRICTED ACCESS

Details

Abstract

GPT-4, which is a multimodal large-scale language model, was released on March 14, 2023. GPT-4 is equipped with Transformer, a machine learning model for natural language processing, which trains a large neural network through unsupervised learning, followed by reinforcement learning from human feedback (RLHF) based on human feedback. Although GPT-4 is one of the research achievements in the field of natural language processing (NLP), it is a technology that can be applied not only to natural language generation but also to image generation. However, specifications for GPT-4 have not been made public, therefore it is difficult to use for research purposes. In this study, we first generated an image database by adjusting parameters using Stable Diffusion, which is a deep learning model that is also used for image generation based on text input and images. And then, we carried out experiments to evaluate the image quality from the generated database, and discussed the quality assessment of the image generation model.

Corresponding author

Register with J-STAGE for free!