Multimodal large language modelのIn-context learningによる車載カメラ映像を用いた高速道路附属物および植生の異常検出

太齊 蓮; 李 想; 五箇 亮太; 斉藤 直輝; 前田 圭介; 鎌田 文幸; 久保 竜志; 川嵜 裕二; 小川 貴弘; 長谷山 美紀

doi:10.11532/jsceiii.6.3_393

Abstract

In response to the ongoing shortage of skilled engineers resulting from Japan’s declining birthrate and aging population, the implementation of AI-assisted inspection systems has become an urgent priority in expressway maintenance. Conventional AI-based approaches typically involve constructing dedicated models for detecting anomalies in specific targets, such as road attachment facilities or vegetation. However, given the wide variety of anomaly types, developing and maintaining separate models for each case presents significant practical limitations. In this study, we apply a multimodal large language model to anomaly detection from in-vehicle camera footage, aiming to identify multiple types of anomalies on expressways, including those involving roadside infrastructure and vegetation, using a single model, and verify its effectiveness. The effectiveness of the proposed method is further evaluated through experiments using real-world in-vehicle footage provided by East Nippon Expressway Company Limited.

Content from these authors

Favorites & Alerts

Corresponding author

Register with J-STAGE for free!