Proceedings of the Annual Conference of JSAI
Online ISSN : 2758-7347
38th (2024)
Session ID : 3Xin2-103
Conference information

Dataset Development of Vision-Language Model for Patent Data
*Kazuya ANDOTsukito MIZOGUCHIHaruki ISHIKAWAAkira IYODASeiya KAWANOKoichiro YOSHINOHirofumi NONAKA
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Details
Abstract

In this study, we developed a dataset for the development of image-language models of text-drawing pairs in patent documents. Specifically, we created a large image-language dataset by mapping patent drawings to explanatory text using standardized expressions in patents.

Content from these authors
© 2024 The Japanese Society for Artificial Intelligence
Previous article Next article
feedback
Top