シーングラフに基づく画像キャプション生成モデルの自動評価と解析

田中 励雄; 和田 唯我; 杉浦 孔明

doi:10.11517/pjsai.JSAI2023.0_4G2OS24c04

37th (2023)

Session ID : 4G2-OS-24c-04

DOI https://doi.org/10.11517/pjsai.JSAI2023.0_4G2OS24c04

Conference information

Host: The Japanese Society for Artificial Intelligence

Name : The 37th Annual Conference of the Japanese Society for Artificial Intelligence

Number : 37

Location : [in Japanese]

Date : June 06, 2023 - June 09, 2023

Automatic Evaluation and Analysis of Image Captioning Models Based on Scene Graphs

*Reo TANAKA, Yuiga WADA, Komei SUGIURA

Author information

Keywords: Image captioning, Automatic evaluation metric, Scene graph, JaSPICE

CONFERENCE PROCEEDINGS FREE ACCESS

Details

Abstract

Image captioning studies rely heavily on automatic evaluation metrics such as BLEU and METEOR, which are based on n-grams. However, these metrics have shown poor correlation with human evaluations, leading to the proposal of alternative metrics such as JaSPICE. JaSPICE has only been validated for a general image captioning task without an error analysis. In this paper, we analyze JaSPICE for a fetching instruction generation task and identify its errors for an image captioning task. We conducted experiments on STAIR Captions and PFN-PIC datasets and JaSPICE outperformed the baseline metrics on the correlation coefficient with human evaluation.

Corresponding author

Conference information

Register with J-STAGE for free!