Doorgaan naar hoofdnavigatie Doorgaan naar zoeken Ga verder naar hoofdinhoud

Preference-based Deep Reinforcement Learning for Historical Route Estimation

  • Boshen Pan
  • , Yaoxin Wu
  • , Zhiguang Cao
  • , Yaqing Hou (Corresponderende auteur)
  • , Guangyu Zou
  • , Qiang Zhang (Corresponderende auteur)

Onderzoeksoutput: Hoofdstuk in Boek/Rapport/CongresprocedureConferentiebijdrageAcademicpeer review

Samenvatting

Recent Deep Reinforcement Learning (DRL) techniques have advanced solutions to Vehicle Routing Problems (VRPs). However, many of these methods focus exclusively on optimizing distance-oriented objectives (i.e., minimizing route length), often overlooking the implicit drivers' preferences for routes. These preferences, which are crucial in practice, are challenging to model using traditional DRL approaches. To address this gap, we propose a preference-based DRL method characterized by its reward design and optimization objective, which is specialized to learn historical route preferences. Our experiments demonstrate that the method aligns generated solutions more closely with human preferences. Moreover, it exhibits strong generalization performance across a variety of instances, offering a robust solution for different VRP scenarios.

Originele taal-2Engels
TitelProceedings of the 34th International Joint Conference on Artificial Intelligence, IJCAI 2025
RedacteurenJames Kwok
UitgeverijInternational Joint Conferences on Artificial Intelligence (IJCAI)
Pagina's8591-8599
Aantal pagina's9
ISBN van elektronische versie9781956792065
DOI's
StatusGepubliceerd - 2025
Evenement34th Internationa Joint Conference on Artificial Intelligence, IJCAI 2025 - Montreal, Canada
Duur: 16 aug. 202522 aug. 2025

Congres

Congres34th Internationa Joint Conference on Artificial Intelligence, IJCAI 2025
Land/RegioCanada
StadMontreal
Periode16/08/2522/08/25

Bibliografische nota

Publisher Copyright:
© 2025 International Joint Conferences on Artificial Intelligence. All rights reserved.

Vingerafdruk

Duik in de onderzoeksthema's van 'Preference-based Deep Reinforcement Learning for Historical Route Estimation'. Samen vormen ze een unieke vingerafdruk.

Citeer dit