Abstract
We study new types of dynamic allocation problems the Halting Bandit models. As an application, we obtain new proofs for the classic Gittins index decomposition result compare Gittins (Journal of the Royal Statistical Society, Series B, 1979, 41, 148–177), and recent results of the authors in Cowan and Katehakis (Probability in the Engineering and Informational Sciences, 2015, 29, 51–76).
Original language | English (US) |
---|---|
Pages (from-to) | 639-652 |
Number of pages | 14 |
Journal | Naval Research Logistics |
Volume | 70 |
Issue number | 7 |
DOIs | |
State | Published - Oct 2023 |
ASJC Scopus subject areas
- Modeling and Simulation
- Ocean Engineering
- Management Science and Operations Research
Keywords
- Markovian decision processes
- adaptive systems
- autonomous reasoning and learning
- dynamic data driven systems
- machine learning