Samenvatting
Resource allocation plays a critical role in minimizing cycle time and improving the efficiency of business processes. Recently, Deep Reinforcement Learning (DRL) has emerged as a powerful technique to optimize resource allocation policies in business processes. In the DRL framework, an agent learns a policy through interaction with the environment, guided solely by reward signals that indicate the quality of its decisions. However, existing algorithms are not suitable for dynamic environments such as business processes. Furthermore, existing DRL-based methods rely on engineered reward functions that approximate the desired objective, but a misalignment between reward and objective can lead to undesired decisions or suboptimal policies. To address these issues, we propose a rollout-based DRL algorithm and a reward function to optimize the objective directly. Our algorithm iteratively improves the policy by evaluating execution trajectories following different actions. Our reward function directly decomposes the objective function of minimizing the cycle time, such that trial-and-error reward engineering becomes unnecessary. We evaluated our method in six scenarios, for which the optimal policy can be computed, and on a set of increasingly complex, realistically sized process models. The results show that our algorithm can learn the optimal policy for the scenarios and outperform or match the best heuristics on the realistically sized business processes.
| Originele taal-2 | Engels |
|---|---|
| Titel | Business Process Management Forum |
| Subtitel | BPM 2025 Forum, Seville, Spain, August 31 – September 5, 2025, Proceedings |
| Redacteuren | Arik Senderovich, Cristina Cabanillas, Irene Vanderfeesten, Hajo A. Reijers |
| Plaats van productie | Cham |
| Uitgeverij | Springer |
| Pagina's | 256-273 |
| Aantal pagina's | 18 |
| ISBN van elektronische versie | 978-3-032-02929-4 |
| ISBN van geprinte versie | 978-3-032-02928-7 |
| DOI's | |
| Status | Gepubliceerd - 27 aug. 2025 |
| Evenement | BPM Forum held at the 23rd International Conference on Business Process Management, BPM 2025 - Seville, Spanje Duur: 31 aug. 2025 → 5 sep. 2025 |
Publicatie series
| Naam | Lecture Notes in Business Information Processing (LNBIP) |
|---|---|
| Volume | 564 |
| ISSN van geprinte versie | 1865-1348 |
| ISSN van elektronische versie | 1865-1356 |
Congres
| Congres | BPM Forum held at the 23rd International Conference on Business Process Management, BPM 2025 |
|---|---|
| Land/Regio | Spanje |
| Stad | Seville |
| Periode | 31/08/25 → 5/09/25 |
Bibliografische nota
Publisher Copyright:© The Author(s), under exclusive license to Springer Nature Switzerland AG 2026.
Vingerafdruk
Duik in de onderzoeksthema's van 'A Rollout-Based Algorithm and Reward Function for Resource Allocation in Business Processes'. Samen vormen ze een unieke vingerafdruk.Citeer dit
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver