A UNIFORM TWO-ARMED BANDIT PROBLEM WITH ONE ARM KNOWN REVISITED

Dieter Kalin; Radu Theodorescu

doi:10.11329/jjss1970.20.159

Abstract

Several decision problems such as bandit problems can be considered as special sequential two-action Markov decision models as described in [2]. In this paper a uniform two-armed bandit problem with one arm known is studied by embedding it in the general framework developed in [2]. Two cases of this problem are examined. The first case assumes that one end point of the uniformity interval of the unknown arm is Pareto distributed. In the second case the joint distribution of the two end points of the uniformity interval of the unknown arm is bilateral Pareto. The results obtained extend and complete those obtained in [6, 7].

Content from these authors

Favorites & Alerts

Add to favorites
Additional info alert
Citation alert
Authentication alert

Corresponding author

Register with J-STAGE for free!