SICE Annual Conference Program and Abstracts
SICE Annual Conference 2002
会議情報

Performance of LQ-learning in POMDP Environments
Haeyeon LeeHiroyuki KamayaKenich Abe
著者情報
会議録・要旨集 フリー

p. 174

詳細
抄録
In this paper, we propose a new type of LQ-learning to solve POMDP. In the POMDP environment, the agent cannot observe the environment directly. In the LQ-learning, in order to dicriminate partially observed states, the agent attaches label to each observation which perceived as the same ones. Unlike our previous LQ-learning, we make preparations of knowledge about the environment in advance. The knowledge is automatically acquired by Kohenen’s Self-Organizing Map (SOM), which provides the knowledge about state transitions to the agent. Then, LQ-learning agent attaches labels to observations with reference to a map obtained by SOM.
著者関連情報
© 2002 SICE
前の記事 次の記事
feedback
Top