Abstract
In this study, we tasked ChatGPT 4.0 with predicting the semantic categories of basic vocabulary nouns extracted from the “Nihongo kyoiku goihyo (Japanese Language Education Vocabulary List).” The correct data was based on the “Bunruigoihyo (Word List by Semantic Principles) (ver.1.0.1),” and statistical analysis was conducted on the accuracy and reliability of the predictions. The analysis revealed that ChatGPT's predictions and the semantic categories of “sections” in the “Bunruigoihyo (Word List by Semantic Principles)” substantially coincide (with an average agreement rate of κ=.706), yet it was evident that there is a notable tendency for misclassification in nouns related to “Shutai (agents)” and “Katsudo (activities).“ Based on these findings, it is considered that generative AI, as represented by ChatGPT, can become one of the important research tools for quantitative research if used with caution.