New Parameter Optimizing Techniques for POMDP Model Design in Reinforcement Learning

Takaki MAKINO; Yasushi ODA; Kazuyuki AIHARA

doi:10.11188/seisankenkyu.65.315

SEISAN KENKYU

Online ISSN : 1881-2058
Print ISSN : 0037-105X
ISSN-L : 0037-105X

J-STAGE home
/
SEISAN KENKYU
/
Volume 65 (2013) Issue 3
/
Article overview

Research Flash

New Parameter Optimizing Techniques for POMDP Model Design in Reinforcement Learning

Takaki MAKINO, Yasushi ODA, Kazuyuki AIHARA

Author information

JOURNAL FREE ACCESS

2013 Volume 65 Issue 3 Pages 315-318

DOI https://doi.org/10.11188/seisankenkyu.65.315

Details

Abstract

One difficulty of model designing problems on the partially observable Markov decision process (POMDP), such as apprenticeship learning, was its high calculation cost for solving the optimal policy of many POMDP problems.　In this paper, we propose two techniques that reduce the calculation cost within such a setting, that is, transfer learning and subgradient calculation. We show that both techniques can be efficiently implemented on a policy-iteration POMDP solver.

Content from these authors

Favorites & Alerts

Add to favorites
Additional info alert
Citation alert
Authentication alert

Corresponding author

Correction information

Register with J-STAGE for free!