Deep Reinforcement Learning for Two-Sided Online Bipartite Matching in Collaborative Order Picking

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

186 Downloads (Pure)

Abstract

As a growing number of warehouse operators are moving from human-only to Collaborative human-robot Order Picking solutions, more efficient picker routing policies are needed, since the complexity of coordinating multiple actors in the system increases significantly. The objective of these policies is to match human pickers and robot carriers to fulfill picking tasks, optimizing pick-rate and total tardiness of the orders. In this paper, we propose to formulate the order picking routing problem as a more general combinatorial optimization problem known as Two-sided Online Bipartite Matching. We present an end-to-end Deep Reinforcement Learning approach to optimize a combination of pick-rate and order tardiness, and to deal with the uncertainty of real-world warehouse environments. To extract and exploit spatial information from the environment, we devise three different Graph Neural Network architectures and empirically evaluate them on several scenarios of growing complexity in a simulation environment we developed. We show that all proposed methods significantly outperform greedy and more sophisticated heuristics, as well as non-GNN-based DRL approaches. Moreover, our methods exhibit good transferability properties, even when scaling up test problem instances to more than forty times the size of the ones the models were trained on.
Original languageEnglish
Title of host publicationProceedings of the 15th Asian Conference on Machine Learning, ACML2023
EditorsBerrin Yanıkoğlu, Wray Buntine
PublisherPMLR
Pages121-136
Number of pages16
Publication statusPublished - 2024
Event15th Asian Conference on Machine Learning - Istanbul, Turkey
Duration: 11 Nov 202314 Nov 2023

Publication series

NameProceedings of Machine Learning Research (PMLR)
Volume222
ISSN (Electronic)2640-3498

Conference

Conference15th Asian Conference on Machine Learning
Abbreviated titleACML2023
Country/TerritoryTurkey
CityIstanbul
Period11/11/2314/11/23

Keywords

  • Deep Reinforcement Learning
  • Graph Neural Networks
  • Collaborative Order Picking
  • Online Bipartite Matching
  • Online Combinatorial Optimization

Fingerprint

Dive into the research topics of 'Deep Reinforcement Learning for Two-Sided Online Bipartite Matching in Collaborative Order Picking'. Together they form a unique fingerprint.

Cite this