視覚言語モデルを用いた検索手法による交通シーン検索の効率化

露木 雅文; 新 吉高; 銭 智定

doi:10.11351/jsaeronbun.55.1139

Abstract

Retrieving relevant traffic scene data from existing database is essential in the development of advanced driver-assistance systems but such task is time consuming and computationally expensive. This study proposes a traffic scene retrieval system that utilizes a vision-language model and clustering techniques. The proposed system is capable of executing data retrieval task by inputting an image data or text as a search query. Evaluation results showed that the system was able to retrieve complex scene data(e.g., traffic congestion) from a driving video database under 3 seconds. Overall, the results indicate that the prosed system is feasible for practical applications.

Content from these authors

Favorites & Alerts

Add to favorites
Additional info alert
Citation alert
Authentication alert

Corresponding author

Register with J-STAGE for free!