Skip to main navigation Skip to search Skip to main content

A reinforcement learning approach for the dynamic vehicle routing and scheduling problem with stochastic request times and time-dependent, stochastic travel times

Research output: Contribution to journalArticleAcademicpeer-review

51 Downloads (Pure)

Abstract

The dynamic vehicle routing and scheduling problem with stochastic request times and time-dependent stochastic travel times poses significant challenges for vehicle fleets that must adapt dynamically to uncertain and fluctuating customer demands and traffic conditions. To effectively address these challenges, we propose an end-to-end approach that integrates routing and scheduling decisions to optimize overall performance. Specifically, our approach combines reinforcement learning with a hybrid attention mechanism comprising a Reverse LSTM and a multi-pointer decoder to optimize vehicle routes and departure times simultaneously. The numerical study highlights the considerable efficiency improvements of the proposed approach compared to existing dynamic approaches, especially the practical experiments on a road network in Vienna. Furthermore, our generalized training strategy significantly improves model adaptability. It reduces computational overhead, underscoring the promising potential of the approach for practical deployment in dynamic and complex real-world logistics operations.
Original languageEnglish
Article number105387
Number of pages21
JournalTransportation Research. Part C: Emerging Technologies
Volume182
DOIs
Publication statusPublished - Jan 2026

Funding

This work has utilized resources and expertise provided by the SURF Experimental Technologies Platform, which is part of the SURF cooperative in the Netherlands (No. EINF-9576). Dawei Chen acknowledges financial support from the China Scholarship Council (No. 202106370026).

Keywords

  • Dynamic vehicle routing and scheduling problem
  • Reinforcement learning
  • Stochastic and time-dependent travel time
  • Stochastic request time

Fingerprint

Dive into the research topics of 'A reinforcement learning approach for the dynamic vehicle routing and scheduling problem with stochastic request times and time-dependent, stochastic travel times'. Together they form a unique fingerprint.

Cite this