SEISAN KENKYU
Online ISSN : 1881-2058
Print ISSN : 0037-105X
ISSN-L : 0037-105X
Research Flash
New Parameter Optimizing Techniques for POMDP Model Design in Reinforcement Learning
Takaki MAKINOYasushi ODAKazuyuki AIHARA
Author information
JOURNAL FREE ACCESS

2013 Volume 65 Issue 3 Pages 315-318

Details
Abstract

One difficulty of model designing problems on the partially observable Markov decision process (POMDP), such as apprenticeship learning, was its high calculation cost for solving the optimal policy of many POMDP problems. In this paper, we propose two techniques that reduce the calculation cost within such a setting, that is, transfer learning and subgradient calculation. We show that both techniques can be efficiently implemented on a policy-iteration POMDP solver.

Content from these authors
© 2013 Institute of Industrial Science The University of Tokyo
Previous article Next article
feedback
Top