Reinforcement Learning with Expectation and Action Augmented States in Partially Observable Environment

Sherwin A. Guirnaldo; Keigo Watanabe; Kiyotaka Izumi; Kazuo Kiguchi

doi:10.11499/sicep.2002.0.175.0

SICE Annual Conference 2002

DOI https://doi.org/10.11499/sicep.2002.0.175.0

会議情報

主催: The Society of Instrument and Control Engineers

共催: IEEE/Industiral Electronic Society, IEEE/Robotics and Automation Society, IEEE/Control System Society

Reinforcement Learning with Expectation and Action Augmented States in Partially Observable Environment

Sherwin A. Guirnaldo, Keigo Watanabe, Kiyotaka Izumi, Kazuo Kiguchi

著者情報

キーワード: Partially observable Markov decision processes, expectation, reinforcement learning, perception, perceptual aliasing

会議録・要旨集フリー

p. 175

詳細

抄録

The problem of developing good or optimal policies for partially observable Markov decision processes (POMDP) remains one of the most alluring areas of research in artificial intelligence. Encourage by the way how we (humans) form expectations from past experiences and how our decisions and behaviour are affected with our expectations, this paper proposes a method called expectation and action augmented states (EAAS) in reinforcement learning aimed to discover good or near optimal policies in partially observable environment. The method uses the concept of expectation to give distinction between aliased states. It works by augmenting the agent’s observation with its expectation of that observation. Two problems from the literature were used to test the proposed method. The results show promising characteristics of the method as compared to some methods currently being used in this domain.

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）