Proceedings of the Annual Conference of JSAI
Online ISSN : 2758-7347
37th (2023)
Session ID : 1O4-GS-7-03
Conference information

Predicate Classification Using Optimal Transport Loss in Scene Graph Generation
*Sorachi KURITASatoshi OYAMAItsuki NODA
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Details
Abstract

We propose a method to generate scene graphs using optimal transport loss as a measure to compare two probability distributions. In scene graph generation, learning with cross-entropy loss leads to biased predictions because the distribution of predicate labels in the dataset has severe imbalance. We apply learning with the optimal transport loss, which easily reflects similarity between labels as transportation cost, to the predicate classification in scene graph generation. In the proposed method, the transportation cost of the optimal transport is defined using the similarity of words obtained from the pre-trained model. The experimental evaluation of the effectiveness shows that the method achieves better performance than existing models in terms of mean Recall@50 and mean Recall@100. Furthermore, it can improve recall of predicate labels that are scarce in the dataset.

Content from these authors
© 2023 The Japanese Society for Artificial Intelligence
Previous article Next article
feedback
Top