Proceedings of the Annual Conference of JSAI
Online ISSN : 2758-7347
39th (2025)
Session ID : 2Win5-23
Conference information

Analysis of Internal Representations of Knowledge with Expressions of Familiarity
*Kenshiro TANAKAYoshihiro SAKAIYufeng ZHAONaoya INOUEKai SATORyosuke TAKAHASHIBenjamin HEINZERLINGKentaro INUI
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Details
Abstract

Research on the ability of large language models (LLMs) to judge the familiarity of knowledge is progressing. However, little attention has been given to whether LLMs can assess the familiarity of knowledge when linguistic expressions such as "It is known that..." are included during training. This study investigates how familiarity is internally represented in LLMs. To achieve this, we trained the models on descriptions of knowledge accompanied by linguistic expressions indicating familiarity. The internal representations of the models were then analyzed. The findings reveal that (1) familiarity information is separately retained in the internal representations of knowledge, for the linguistic expressions provided during training, and (2) familiarity information is separately maintained for each position of the linguistic expressions. This study provides a foundation for understanding the mechanisms underlying LLMs' ability to judge familiarity.

Content from these authors
© 2025 The Japanese Society for Artificial Intelligence
Previous article Next article
feedback
Top