2024 Volume 8 Issue 3 Pages 115-119
Automatically extracting information such as text data and illustrations from images in digital archives and providing them to users is an approach that has been attracting attention in terms of full-text search support and accessibility improvement in conjunction with increasingly sophisticated machine learning. The National Diet Library (NDL) has a demonstration site called NDL Lab. It has implemented and released experimental functions of information extraction methods based on machine learning and has reflected the findings and user responses obtained in the development process of the NDL Digital Collections and other products. This paper describes the information extraction technology and introduces the findings obtained through the operation of experimental services at the NDL Lab.