Skip to main navigation Skip to search Skip to main content

Surrogate Model-Based Reinforcement Learning for Bidding Strategies in Local Flexibility Markets

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

6 Downloads (Pure)

Abstract

Reinforcement Learning (RL) has emerged as a promising tool for designing bidding strategies in electricity markets. Online RL learns policies by directly interacting with the environment, adapting in real time through trial-and-error feedback, while offline RL trains solely on a fixed, pre-collected dataset without any interaction with the environment. However, existing online RL methods assume detailed knowledge of the market-clearing model, including grid constraints, network parameters, and other participants' bids, which is unrealistic in a real-world setting. Additionally, the exploration process bears the risk of financial losses for market participants. Purely offline RL, on the other hand, suffers from its reliance on historical data that may be limited and non-diverse; as a result, it cannot gather new feedback to explore unobserved scenarios, potentially leading to sub-optimal results. To address these limitations, this paper proposes a surrogate model-based RL approach. Firstly, a machine learning surrogate model is constructed to approximate the local flexibility market (LFM) clearing process, using historical bids and corresponding market outcomes of the market participant. The surrogate model predicts the profit of the market participant associated with each bid in terms of price and quantity, thus avoiding the need for the true market model. A twin delayed deep deterministic policy gradient (TD3) RL bidding agent is then trained on this learned surrogate LFM model, enabling systematic exploration with reduced financial risk. Comparative simulations in the case study with three approaches—online RL, offline RL, and the proposed surrogate-based RL—reveal that online RL achieves the highest profit. Offline RL yields the lowest profit, constrained by limited historical data. The surrogatebased RL strategy outperforms the offline approach, offering a practical balance between profitability, and exploration risk.
Original languageEnglish
Title of host publication2025 IEEE International Conference on Environment and Electrical Engineering and 2025 IEEE Industrial and Commercial Power Systems Europe (EEEIC / I&CPS Europe)
EditorsZbigniew Leonowicz, Erika Stracqualursi, Michal Jasinski
PublisherInstitute of Electrical and Electronics Engineers
Number of pages6
ISBN (Electronic)979-8-3315-9515-9
DOIs
Publication statusPublished - 30 Sept 2025
Event25th IEEE International Conference on Environment and Electrical Engineering, & 9th IEEE Industrial and Commercial Power Systems Europe, EEEIC / I&CPS Europe
- Chania, Greece
Duration: 15 Jul 202518 Jul 2025

Conference

Conference25th IEEE International Conference on Environment and Electrical Engineering, & 9th IEEE Industrial and Commercial Power Systems Europe, EEEIC / I&CPS Europe
Country/TerritoryGreece
CityChania
Period15/07/2518/07/25

Funding

This publication is part of the research program 'MEGAMIND - Enabling distributed operation of energy infrastructures through Measuring, Gathering, Mining and Integrating grid-edge Data', (partly) financed by the Dutch Research Council (NWO), through the Perspectief funding instrument under number P19-25.

FundersFunder number
Nederlandse Organisatie voor Wetenschappelijk OnderzoekP19-25

    UN SDGs

    This output contributes to the following UN Sustainable Development Goals (SDGs)

    1. SDG 7 - Affordable and Clean Energy
      SDG 7 Affordable and Clean Energy

    Keywords

    • Local Flexibility Market
    • Machine Learning
    • Reinforcement Learning
    • Strategic Bidding
    • Surrogate Model

    Fingerprint

    Dive into the research topics of 'Surrogate Model-Based Reinforcement Learning for Bidding Strategies in Local Flexibility Markets'. Together they form a unique fingerprint.

    Cite this