NTCIR-17 QA Lab-PoliInfo-4におけるQAタスクおよびAVタスクの分析

小川 泰弘; 石川 晴基; 秋葉 友良

doi:10.11517/pjsai.JSAI2024.0_3M1OS12a01

Abstract

This study aims to analyze what errors occur in QA and summarization systems using large language models and whether such errors can be detected. The data are the results of the Question Answering (QA) and Answer Verification (AV) tasks of the NTCIR-17 QA Lab-PoliInfo-4 using assembly minutes. The QA task is a task to output a summary of the corresponding answer to the input question summary in the assembly minutes, and we analyzed the errors in the results. The AV task is a task to judge whether the QA task's output is correct, and we analyzed what kind of output is misjudged.

Content from these authors

Favorites & Alerts

Add to favorites
Additional info alert
Citation alert
Authentication alert

Corresponding author

Conference information

Register with J-STAGE for free!