ファッションコーディネートの説明文生成における人間の評価と相関する評価関数の探索

藤崎 勇哉; 岸本 将志; 田崎 夏代; 都筑 友昭

doi:10.11517/pjsai.JSAI2024.0_2T5OS5b04

Abstract

To apply Large Language Models (LLMs) in the real world, it is crucial that the text they generate is of value to humans and of a quality that is acceptable to humans. This study aims to find evaluation functions that correlate with human evaluations of fashion coordination descriptions generated by LLMs. Identifying such evaluation functions could allow for the improvement of the accuracy of fashion coordination description generation models in a direction aligned with human values, and potentially automate the entire process from description generation to evaluation. In this research, fashion coordination descriptions generated by LLMs were evaluated by skilled fashion stylists, and a dataset was created based on their evaluation. Using this dataset, we sought to find evaluation metrics that correlate with human evaluations. The candidates for these functions were functions used in the abstractive summarization task.

Content from these authors

Favorites & Alerts

Corresponding author

Conference information

Register with J-STAGE for free!