マルチラベル物体認識への制約知識の導入とROAD-Rへの適用

森山 総太; 渡邉 晃司; 井上 克巳; 竹村 彰浩

doi:10.11517/pjsai.JSAI2024.0_2M1OS11a03

38th (2024)

Session ID : 2M1-OS-11a-03

DOI https://doi.org/10.11517/pjsai.JSAI2024.0_2M1OS11a03

Conference information

Host: The Japanese Society for Artificial Intelligence

Name : The 38th Annual Conference of the Japanese Society for Artificial Intelligence

Number : 38

Location : [in Japanese]

Date : May 28, 2024 - May 31, 2024

Introducing Constraints to Multilabel Object Detection and application to ROAD-R

*Sota MORIYAMA, Koji WATANABE, Katsumi INOUE, Akihiro TAKEMURA

Author information

Keywords: Object Recognition, Boolean Satisfiability Problem, Constrains, Autonomous Driving

CONFERENCE PROCEEDINGS FREE ACCESS

Details

Abstract

Detecting the actions of each object is detrimental to improving the usability of the model, but the risk of misrecognition increases as the number of label combinations increases. Therefore, we propose a framework that reduces the amount of misrecognition by utilizing the requirements that the set of labels has to satisfy. Specifically, we propose MOD<sub>YOLO</sub>, a novel multilabel object detection model built upon the state-of-the-art object detection model YOLOv8, and develop our framework on top of it. We then assess the framework's effectiveness by applying it to the ROAD-R Challenge for NeurIPS 2023 competition. For Task 1, we introduce the Corrector Model and Blender Model, two new models that follow after the object detection process, aiming to generate a more constrained output. For Task 2, constrained losses have been incorporated into the training process of MOD<sub>YOLO</sub> using Fuzzy Logic. The results show that using the above framework was instrumental to improving the scores for both Tasks 1 and 2, allowing us to place third and first in the subsequent tasks.

Corresponding author

Conference information

Register with J-STAGE for free!