TY - JOUR
T1 - Solving Large-Scale Dynamic Vehicle Routing Problems with Stochastic Requests
AU - Zhang, Jian
AU - Luo, Kelin
AU - Florio, Alexandre M.
AU - van Woensel, Tom
PY - 2023/4/16
Y1 - 2023/4/16
N2 - Dynamic vehicle routing problems (DVRPs) arise in several applications such as technician routing, meal delivery, and parcel shipping. We consider the DVRP with stochastic customer requests (DVRPSR), in which vehicles must be routed dynamically with the goal of maximizing the number of served requests. We model the DVRPSR as a multi-stage optimization problem, where the first-stage decision defines route plans for serving scheduled requests. Our main contributions are knapsack-based linear models to approximate accurately the expected reward-to-go, measured as the number of accepted requests, at any state of the stochastic system. These approximations are based on representing each vehicle as a knapsack with a capacity given by the remaining service time available along the vehicle’s route. We combine these approximations with optimal acceptance and assignment decision rules and derive efficient and high-performing online scheduling policies. We further leverage good predictions of the expected reward-to-go to design initial route plans that facilitate serving dynamic requests. Computational experiments on very large instances based on a real street network demonstrate the effectiveness of the proposed methods in prescribing high-quality offline route plans and online scheduling decisions.
AB - Dynamic vehicle routing problems (DVRPs) arise in several applications such as technician routing, meal delivery, and parcel shipping. We consider the DVRP with stochastic customer requests (DVRPSR), in which vehicles must be routed dynamically with the goal of maximizing the number of served requests. We model the DVRPSR as a multi-stage optimization problem, where the first-stage decision defines route plans for serving scheduled requests. Our main contributions are knapsack-based linear models to approximate accurately the expected reward-to-go, measured as the number of accepted requests, at any state of the stochastic system. These approximations are based on representing each vehicle as a knapsack with a capacity given by the remaining service time available along the vehicle’s route. We combine these approximations with optimal acceptance and assignment decision rules and derive efficient and high-performing online scheduling policies. We further leverage good predictions of the expected reward-to-go to design initial route plans that facilitate serving dynamic requests. Computational experiments on very large instances based on a real street network demonstrate the effectiveness of the proposed methods in prescribing high-quality offline route plans and online scheduling decisions.
KW - Approximate dynamic programming
KW - Column generation
KW - Markov decision processes
KW - Routing
KW - Stochastic requests
UR - http://www.scopus.com/inward/record.url?scp=85135418150&partnerID=8YFLogxK
U2 - 10.1016/j.ejor.2022.07.015
DO - 10.1016/j.ejor.2022.07.015
M3 - Article
SN - 0377-2217
VL - 306
SP - 596
EP - 614
JO - European Journal of Operational Research
JF - European Journal of Operational Research
IS - 2
ER -