人工知能学会全国大会論文集
Online ISSN : 2758-7347
38th (2024)
セッションID: 2Q5-IS-1-01
会議情報

Analysis of Tender Documents Using Sequence Labeling with LLM-based Improver
*Tomoki ITOHiroki SAKAJI
著者情報
会議録・要旨集 フリー

詳細
抄録

Bidders often take a long time to read and understand tender documents because they require specialized knowledge, and tender documents are generally long. Here, the function that can extract specific items (i.e., item extractor) and the function that can highlight words or phrases related to specific items (i.e., word-phrase highlighter) are in great demand. To develop such type of functions, we need to solve two problems. The first problem is the problem related to the annotated data set. The second problem concerns the BERT-based sequence labeling approach in a small training dataset setting. To solve the first problem, we created two types of sequence labeling datasets related to Item Extractor and Word-Phrase Highlighter. To solve the second problem, we propose the Information Extraction (IE) method, which combines (1) a supervised learning approach using BERT-based sequence labeling and (2) a large language model (LLM)-based improver. Experimental evaluation demonstrates the effectivenes of our approach. Moreover, as an application, We then developed the web application system called Tender Document Analyzer (TDDA).

著者関連情報
© 2024 The Japanese Society for Artificial Intelligence
前の記事 次の記事
feedback
Top