強化学習と汎用エージェント

相澤 彰子

doi:10.11517/jsaisigtwo.2017.AGI-006_05

Abstract

With the recent advancements and developments of deep learning techniques, 'reinforcement learning,' a framework based on the interaction between an agent and the environment, attracts a great deal of attention. This presentation introduces a universal agent model called AIXI (AIξ) proposed by Marcus Hutter (references 1,2). The AIXI model is based on the algorithmic information theory founded by Ray Solomonoff and uses a universal prior distribution in the agent optimization strategy. Based on this formulation, the AIXI can be interpreted as an agent model that can take an optimal strategy under any circumstances. The formulation of such universal agent is an attempt to answer the fundamental question of universal intelligence and may give some hints on how to deepening the reinforcement learning.

Content from these authors

Favorites & Alerts

Corresponding author

Conference information

Register with J-STAGE for free!