大規模言語モデルの外国語教育能力評価：教育文法に着目した実験的研究

王 棟

doi:10.11517/jsaislud.103.0_80

Abstract

This study examines the potential applications of large language models (LLMs) in language education. To evaluate their ability to identify the compatibility between grammatical items and example sentences, we designed a task and conducted experiments. Using multiple LLMs, we compared their performance based on accuracy, false negative rate (FN rate), false positive rate (FP rate), and Balanced Score. Additionally, we confirmed that synthetic data could serve as a practical alternative. Future research should focus on developing high-quality synthetic data generation methods and expanding their applicability. The findings of this study are expected to contribute to the establishment of benchmarks for evaluating the grammatical competence of LLMs in natural language.

Content from these authors

Favorites & Alerts

Corresponding author

Conference information

Register with J-STAGE for free!