A Rollout-Based Algorithm and Reward Function for Resource Allocation in Business Processes

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

Abstract

Resource allocation plays a critical role in minimizing cycle time and improving the efficiency of business processes. Recently, Deep Reinforcement Learning (DRL) has emerged as a powerful technique to optimize resource allocation policies in business processes. In the DRL framework, an agent learns a policy through interaction with the environment, guided solely by reward signals that indicate the quality of its decisions. However, existing algorithms are not suitable for dynamic environments such as business processes. Furthermore, existing DRL-based methods rely on engineered reward functions that approximate the desired objective, but a misalignment between reward and objective can lead to undesired decisions or suboptimal policies. To address these issues, we propose a rollout-based DRL algorithm and a reward function to optimize the objective directly. Our algorithm iteratively improves the policy by evaluating execution trajectories following different actions. Our reward function directly decomposes the objective function of minimizing the cycle time, such that trial-and-error reward engineering becomes unnecessary. We evaluated our method in six scenarios, for which the optimal policy can be computed, and on a set of increasingly complex, realistically sized process models. The results show that our algorithm can learn the optimal policy for the scenarios and outperform or match the best heuristics on the realistically sized business processes.

Original languageEnglish
Title of host publicationBusiness Process Management Forum
Subtitle of host publicationBPM 2025 Forum, Seville, Spain, August 31 – September 5, 2025, Proceedings
EditorsArik Senderovich, Cristina Cabanillas, Irene Vanderfeesten, Hajo A. Reijers
Place of PublicationCham
PublisherSpringer
Pages256-273
Number of pages18
ISBN (Electronic)978-3-032-02929-4
ISBN (Print)978-3-032-02928-7
DOIs
Publication statusPublished - 27 Aug 2025
EventBPM Forum held at the 23rd International Conference on Business Process Management, BPM 2025 - Seville, Spain
Duration: 31 Aug 20255 Sept 2025

Publication series

NameLecture Notes in Business Information Processing (LNBIP)
Volume564
ISSN (Print)1865-1348
ISSN (Electronic)1865-1356

Conference

ConferenceBPM Forum held at the 23rd International Conference on Business Process Management, BPM 2025
Country/TerritorySpain
CitySeville
Period31/08/255/09/25

Bibliographical note

Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2026.

Keywords

  • Business process optimization
  • Deep Reinforcement Learning
  • Resource allocation
  • Reward function
  • Rollouts

Fingerprint

Dive into the research topics of 'A Rollout-Based Algorithm and Reward Function for Resource Allocation in Business Processes'. Together they form a unique fingerprint.

Cite this