Learning incentivization strategy for resource rebalancing in shared services with a budget constraint

Shen Shyang Ho, Matthew Schofield, Ning Wang

Research output: Contribution to journalArticlepeer-review

Abstract

In this paper, we describe the problem of learning an optimal incentivization strategy that maximizes the service level given a fixed budget constraint for a sharing service such as bike-sharing, car-sharing, etc. in a spatiotemporal environment. The service level can be affected due to an imbalance in supply and demand at different locations during a specific time period. We describe and present our study and comparison of various reinforcement learning algorithms on a 1-D problem setting in a simulated bike-share system with a budget constraint on the incentives. We empirically study the performance of three policy gradient based reinforcement learning algorithms, namely: Proximal Policy Optimization (PPO), Trust Region Policy Optimization (TRPO), and Actor Critic using Kronecker-Factored Trust Region (ACKTR).

Original languageEnglish (US)
Pages (from-to)105-114
Number of pages10
JournalJournal of Applied and Numerical Optimization
Volume3
Issue number1
DOIs
StatePublished - Apr 2021

All Science Journal Classification (ASJC) codes

  • Computational Mathematics
  • Control and Optimization
  • Numerical Analysis
  • Modeling and Simulation

Fingerprint

Dive into the research topics of 'Learning incentivization strategy for resource rebalancing in shared services with a budget constraint'. Together they form a unique fingerprint.

Cite this