Proceedings of the Annual Conference of JSAI
Online ISSN : 2758-7347
38th (2024)
Session ID : 2M1-OS-11a-03
Conference information

Introducing Constraints to Multilabel Object Detection and application to ROAD-R
*Sota MORIYAMAKoji WATANABEKatsumi INOUEAkihiro TAKEMURA
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Details
Abstract

Detecting the actions of each object is detrimental to improving the usability of the model, but the risk of misrecognition increases as the number of label combinations increases. Therefore, we propose a framework that reduces the amount of misrecognition by utilizing the requirements that the set of labels has to satisfy. Specifically, we propose MOD<sub>YOLO</sub>, a novel multilabel object detection model built upon the state-of-the-art object detection model YOLOv8, and develop our framework on top of it. We then assess the framework's effectiveness by applying it to the ROAD-R Challenge for NeurIPS 2023 competition. For Task 1, we introduce the Corrector Model and Blender Model, two new models that follow after the object detection process, aiming to generate a more constrained output. For Task 2, constrained losses have been incorporated into the training process of MOD<sub>YOLO</sub> using Fuzzy Logic. The results show that using the above framework was instrumental to improving the scores for both Tasks 1 and 2, allowing us to place third and first in the subsequent tasks.

Content from these authors
© 2024 The Japanese Society for Artificial Intelligence
Previous article Next article
feedback
Top