大規模言語モデルを用いた金融テキスト二値分類タスクの定義文生成とチューニング手法の提案

高野 海斗; 中川 慧; 藤本 悠吾

doi:10.11517/jsaisigtwo.2024.FIN-033_155

Abstract

Text classification using large language models (LLMs) often has low interpretability and is hard to adjust manually. On the other hand, zero-shot learning, which uses definition statements in LLMs for classification, is more interpretable but creating good definition statements is a challenging task. Therfore, we propose a method to automatically generate definition statements using LLMs to improve classification accuracy and interpretability in (binary) text classification. The proposed method first randomly splits the labeled data and generates (initial) definition statements based on sampled data. Then, it classifies the labeled data using these statements and updates them by inputting misclassified data back into LLMs, repeating this process to improve the definition statements. Experiments with real-world texts show that the proposed method performs well compared to fine-tuned BERT model and LLM few-shot learning and creates appropriate definition sentences.

Content from these authors

Favorites & Alerts

Corresponding author

Conference information

Register with J-STAGE for free!