On the asymptotic optimality of greedy index heuristics for multi-action restless bandits

Hodge, David and Glazebrook, Kevin (2015) On the asymptotic optimality of greedy index heuristics for multi-action restless bandits. Advances in Applied Probability, 47 (3). pp. 652-667. ISSN 0001-8678

Preview

PDF (2014_HGFinal_Cut)
2014_HGFinal_Cut.pdf - Submitted Version
Download (254kB)

Abstract

The class of restless bandits as proposed by Whittle (1988) have long been known to be intractable. This paper presents an optimality result which extends that of Weber and Weiss (1990) for restless bandits to a more general setting in which individual bandits have multiple levels of activation but are subject to an overall resource constraint. The contribution is motivated by the recent works of Glazebrook et al. (2011a), (2011b) who discussed the performance of index heuristics for resource allocation in such systems. Hitherto, index heuristics have been shown, under a condition of full indexability, to be optimal for a natural Lagrangian relaxation of such problems in which a resource is purchased rather than constrained. We find that under key assumptions about the nature of solutions to a deterministic differential equation that the index heuristics above are asymptotically optimal in a sense described by Whittle. We then demonstrate that these assumptions always hold for three-state bandits.

Item Type:

Journal Article

Journal or Publication Title:

Advances in Applied Probability

Uncontrolled Keywords:

/dk/atira/pure/subjectarea/asjc/2600/2604

Subjects:

?? applied mathematicsstatistics and probability ??

Departments:

Lancaster University Management School > Management Science

ID Code:

71018

Deposited By:

ep_importer_pure

Deposited On:

30 Sep 2014 14:55

Refereed?:

Yes

Published?:

Published

Last Modified:

11 Dec 2025 00:14

URI:

https://eprints.lancs.ac.uk/id/eprint/71018