実運用に向けた一人称視点動画に対する機械学習手法の詳細分析と改善の試み

竹内 太法; 関 喜史; 佐藤 可直

doi:10.11517/pjsai.JSAI2023.0_1O5GS704

Abstract

In this study, we aim to apply machine learning techniques to first-person videos and perform a detailed analysis of the experimental results using the existing method, Ego-Exo. In recent years, machine learning research on first-person videos has become popular. However, detailed analysis of the output of prediction models has not been published much, and knowledge for practical application is lacking. The results of the analysis suggest two findings. Firstly, the performance of label prediction depends on the number of samples of each label. We found that labels with a large number of samples have high prediction performance. Secondly, label prediction performance is high for obvious actions and objects, and low for other labels. These findings are important for building datasets for domain-specific tasks.

Content from these authors

Favorites & Alerts

Corresponding author

Conference information

Register with J-STAGE for free!