Host: The Japanese Society for Artificial Intelligence
Name : 34th Annual Conference, 2020
Number : 34
Location : Online
Date : June 09, 2020 - June 12, 2020
Patent data is generally useful for companies to develop their business strategies based on technology trends. Although patent similarity estimation is a critical step for analyzing technology trends, previous methods relied primarily on unsupervised sentence vectorization in which manual laboring (e.g., thesaurus definition) was needed. Here we introduce a new approach for obtaining embedded vectors of patent claim sentences based on recurrent neural network. The network is trained by a newly developed task to discriminate whether a claim-pair is similar or not. We demonstrate that the discrimination task can be solved by the network with high accuracy, and that the patent technology map created by claim-sentence vectors derived from the trained model clearly separates multiple technology fields included in the patent dataset of interest, without manually elaborating thesaurus and stop-word lists.