画像予測モデルを導入した価値関数に基づく深層強化学習

加藤 誉基; 西片 智広; 山内 悠嗣

doi:10.2493/jjspe.91.518

抄録

Reinforcement learning is an unsupervised learning method that enables an agent to learn its behavior via interaction with the environment. By maximizing the value that represents the expected reward over a certain period of time, the agent can learn to perform the required action. To obtain a high value, selecting the optimal action in an unknown future state is necessary. If an unknown future state can be predicted in advance, better actions can be performed. Therefore, obtaining a high value as a result is possible. In this study, we use a deep learning-based future image generation model to predict unknown future states in advance. By predicting the future state, selecting actions that lead to a higher value is possible. Thus, higher rewards can be expected at an early stage.

著者関連情報

お気に入り & アラート

お気に入りに追加
追加情報アラート
被引用アラート
認証解除アラート

閲覧履歴

七寺所蔵『釋淨土群疑論』写本について
Comparison of the Hamstring Muscle Activity and Flexion-Relaxation Ratio between Asymptomatic Persons and Computer Work-related Low Back Pain Sufferers
原子間力顕微鏡による有機結晶表面の水和構造観察とその多形間および結晶面間での比較
発作前駆期rizatriptan内服とamitriptyline予防投与が奏効した重症周期性嘔吐症の1例
萩原乙彦の歌謡書

前身誌

精密機械

精密工学会誌論文集

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）