Host: The Japanese Society for Artificial Intelligence
Name : The 38th Annual Conference of the Japanese Society for Artificial Intelligence
Number : 38
Location : [in Japanese]
Date : May 28, 2024 - May 31, 2024
Detecting the actions of each object is detrimental to improving the usability of the model, but the risk of misrecognition increases as the number of label combinations increases. Therefore, we propose a framework that reduces the amount of misrecognition by utilizing the requirements that the set of labels has to satisfy. Specifically, we propose MOD<sub>YOLO</sub>, a novel multilabel object detection model built upon the state-of-the-art object detection model YOLOv8, and develop our framework on top of it. We then assess the framework's effectiveness by applying it to the ROAD-R Challenge for NeurIPS 2023 competition. For Task 1, we introduce the Corrector Model and Blender Model, two new models that follow after the object detection process, aiming to generate a more constrained output. For Task 2, constrained losses have been incorporated into the training process of MOD<sub>YOLO</sub> using Fuzzy Logic. The results show that using the above framework was instrumental to improving the scores for both Tasks 1 and 2, allowing us to place third and first in the subsequent tasks.