鎖型状態行動学習の提案

野津 亮; 小森 祐希; 本多 克宏; 市橋 秀友; 岩元 優希

doi:10.14864/fss.28.0_225

28th Fuzzy System Symposium

DOI https://doi.org/10.14864/fss.28.0_225

Conference information

Host: Japan Society for Fuzzy Theory and Intelligent Informatics (SOFT)

main

Proposal of Chain Form Reinforcement Learning

NOTSU Akira, KOMORI YUKI, HONDA Katsuhiro, ICHIHASHI Hidetomo, IWAMOTO Yuki

Author information

Keywords: Reinforcement learning, Q-learning, State-Action set categorization

CONFERENCE PROCEEDINGS OPEN ACCESS

Pages 225-228

Details

Abstract

We propose Chain Form Reinforcement Learning for a reinforcement learning agent. In the real world, learning is difficult because there are an infinite number of states and actions that need a large number of stored memories and learning times. To solve a problem, estimated values are categorized as"GOOD" or "NO GOOD" in the reinforcement learning process. Additionally, the alignment sequence of estimated values is changed because they are regardedas an important sequence themselves. We conducted some simulations and observed the influence of our methods.

Corresponding author

Register with J-STAGE for free!