Host: The Japanese Society for Artificial Intelligence
Name : The 38th Annual Conference of the Japanese Society for Artificial Intelligence
Number : 38
Location : [in Japanese]
Date : May 28, 2024 - May 31, 2024
Despite significant research efforts to integrate human judgment to improve model interpretability, there is a continued need to enhance the efficiency of evaluation algorithms in this domain. It's important to note that human perceptions may not consistently align with dataset labels. Therefore, we developed a topic model architecture to address this discrepancy. While topic modeling is commonly associated with language models, we introduced a contrastive topic modeling approach on clustering results of human-annotated images. Semi-supervised clustering incorporates must-link constraints for similar items and cannot-link constraints for dissimilar items, which are provided by humans. Our method aligns image patches clustering with the similarity measurement between prototypes and dataset samples in the model during training. It ensures that the deep neural network, while predicting images, transfers human knowledge from a multi-semantic topic derived from the clustering result to individual samples. This process generates intrinsic global topic explanations, illuminating salient image features and capturing both positive and negative relations. Our experimental results achieve highly competitive outcomes and signify direct visual concept examples for ease of understanding.