教師なし深度補完ネットワークを使用したセンサーフュージョンに基づく物体検出フレームワーク

羅 敏杰; 楊 波; 中野 公彦

doi:10.11188/seisankenkyu.76.75

Abstract

In this paper, a novel perception framework is presented for 2D and 3D object detection, based on sensor fusion of cameras and Li-DAR. While camera images provide abundant environmental features, they lack depth information. Conversely, Li-DAR point clouds offer accurate depth information, which however, are sparse in nature. Recognizing the complementary nature of each sensor’s strengths and weaknesses, an unsupervised depth completion network to enrich information from both sensors is used. This enhanced data is then utilized for performing 2D and 3D object detection tasks using a state-of-the-art detection network. The proposed framework is validated on KITTI data set, and experimental results demonstrate notable improvements in both 2D and 3D tasks when compared to baseline results.

Content from these authors

Favorites & Alerts

Add to favorites
Additional info alert
Citation alert
Authentication alert

Corresponding author

Register with J-STAGE for free!