Host: The Japanese Society for Artificial Intelligence
Name : The 37th Annual Conference of the Japanese Society for Artificial Intelligence
Number : 37
Location : [in Japanese]
Date : June 06, 2023 - June 09, 2023
In this study, we aim to apply machine learning techniques to first-person videos and perform a detailed analysis of the experimental results using the existing method, Ego-Exo. In recent years, machine learning research on first-person videos has become popular. However, detailed analysis of the output of prediction models has not been published much, and knowledge for practical application is lacking. The results of the analysis suggest two findings. Firstly, the performance of label prediction depends on the number of samples of each label. We found that labels with a large number of samples have high prediction performance. Secondly, label prediction performance is high for obvious actions and objects, and low for other labels. These findings are important for building datasets for domain-specific tasks.