Doorgaan naar hoofdnavigatie Doorgaan naar zoeken Ga verder naar hoofdinhoud

Large Language Models as End-to-end Combinatorial Optimization Solvers

Onderzoeksoutput: Hoofdstuk in Boek/Rapport/CongresprocedureConferentiebijdrageAcademicpeer review

12 Downloads (Pure)

Samenvatting

Combinatorial optimization (CO) problems, central to decision-making scenarios like logistics and manufacturing, are traditionally solved using problem-specific algorithms requiring significant domain expertise. While large language models (LLMs) have shown promise in automating CO problem solving, existing approaches rely on intermediate steps such as code generation or solver invocation, limiting their generality and accessibility. This paper introduces a novel framework that empowers LLMs to serve as end-to-end CO solvers by directly mapping natural language problem descriptions to solutions. We propose a two-stage training strategy: supervised fine-tuning (SFT) imparts LLMs with solution generation patterns from domain-specific solvers, while a feasibility-and-optimality-aware reinforcement learning (FOARL) process explicitly mitigates constraint violations and refines solution quality. Evaluation across seven NP-hard CO problems shows that our method achieves a high feasibility rate and reduces the average optimality gap to 1.03-8.20% by tuning a 7B-parameter LLM, surpassing both general-purpose LLMs (e.g., GPT-4o), reasoning models (e.g., DeepSeek-R1), and domain-specific heuristics. Our method establishes a unified language-based pipeline for CO without extensive code execution or manual architectural adjustments for different problems, offering a general and language-driven alternative to traditional solver design while maintaining relative feasibility guarantees.
Originele taal-2Engels
TitelThe 39th Annual Conference on Neural Information Processing Systems (NeurIPS 2025)
UitgeverijNeural information processing systems foundation
Aantal pagina's40
StatusGeaccepteerd/In druk - 2025
Evenement39th Annual Conference on Neural Information Processing Systems, NeurIPS 2025 - San Diego, Verenigde Staten van Amerika
Duur: 2 dec. 20257 dec. 2025

Congres

Congres39th Annual Conference on Neural Information Processing Systems, NeurIPS 2025
Land/RegioVerenigde Staten van Amerika
StadSan Diego
Periode2/12/257/12/25

Vingerafdruk

Duik in de onderzoeksthema's van 'Large Language Models as End-to-end Combinatorial Optimization Solvers'. Samen vormen ze een unieke vingerafdruk.

Citeer dit