音声・物体・物語レベルの意味理解に関するヒト脳活動と大規模言語モデルの潜在表現の対応

中木 裕子; 松山 卓矢; 小出（間島） 真子; 山口 裕人; 久保 理恵子; 西本 伸志; 高木 優

doi:10.11517/pjsai.JSAI2024.0_2K6OS20b01

38th (2024)

Session ID : 2K6-OS-20b-01

DOI https://doi.org/10.11517/pjsai.JSAI2024.0_2K6OS20b01

Conference information

Host: The Japanese Society for Artificial Intelligence

Name : The 38th Annual Conference of the Japanese Society for Artificial Intelligence

Number : 38

Location : [in Japanese]

Date : May 28, 2024 - May 31, 2024

Correspondence between human brain activity and the latent representations of Large Language Models during the semantic comprehension of speech, objects, and stories

*Yuko NAKAGI, Takuya MATSUYAMA, Naoko KOIDE-MAJIMA, Hiroto YAMAGUCHI, Rieko KUBO, Shinji NISHIMOTO, Yu TAKAGI

Author information

Keywords: Brain Science, Large Language Model, Natural Language Processing

CONFERENCE PROCEEDINGS FREE ACCESS

Details

Abstract

One of the major goals in Artificial Intelligence research is to construct machine learning models that comprehend semantics as humans do. While Large Language Models (LLMs) have significantly improved the benchmarks in semantic comprehension, how LLMs’ internal representations encode semantic information and their resemblance to the human brain remain poorly understood. This study aims to elucidate these mechanisms by examining the correspondence between human brain activity during semantic comprehension and the latent representations of LLMs. We collected human brain activity using functional magnetic resonance imaging (fMRI) when human subjects watched drama series. We also collected annotations at various levels related to the drama, such as speech, objects, and stories, and we extracted the corresponding latent representations from LLMs. We demonstrate that, especially for higher-level semantic contents, the latent representations of LLMs explain human brain activity more accurately than traditional language models. Additionally, we show that distinct brain regions correspond to different latent representations in LLMs, inferred from the different levels of semantic contents.

Corresponding author

Conference information

Register with J-STAGE for free!