Abstract
Reinforcement Learning (RL) has emerged as a promising tool for designing bidding strategies in electricity markets. Online RL learns policies by directly interacting with the environment, adapting in real time through trial-and-error feedback, while offline RL trains solely on a fixed, pre-collected dataset without any interaction with the environment. However, existing online RL methods assume detailed knowledge of the market-clearing model, including grid constraints, network parameters, and other participants' bids, which is unrealistic in a real-world setting. Additionally, the exploration process bears the risk of financial losses for market participants. Purely offline RL, on the other hand, suffers from its reliance on historical data that may be limited and non-diverse; as a result, it cannot gather new feedback to explore unobserved scenarios, potentially leading to sub-optimal results. To address these limitations, this paper proposes a surrogate model-based RL approach. Firstly, a machine learning surrogate model is constructed to approximate the local flexibility market (LFM) clearing process, using historical bids and corresponding market outcomes of the market participant. The surrogate model predicts the profit of the market participant associated with each bid in terms of price and quantity, thus avoiding the need for the true market model. A twin delayed deep deterministic policy gradient (TD3) RL bidding agent is then trained on this learned surrogate LFM model, enabling systematic exploration with reduced financial risk. Comparative simulations in the case study with three approaches—online RL, offline RL, and the proposed surrogate-based RL—reveal that online RL achieves the highest profit. Offline RL yields the lowest profit, constrained by limited historical data. The surrogatebased RL strategy outperforms the offline approach, offering a practical balance between profitability, and exploration risk.
| Original language | English |
|---|---|
| Title of host publication | 2025 IEEE International Conference on Environment and Electrical Engineering and 2025 IEEE Industrial and Commercial Power Systems Europe (EEEIC / I&CPS Europe) |
| Editors | Zbigniew Leonowicz, Erika Stracqualursi, Michal Jasinski |
| Publisher | Institute of Electrical and Electronics Engineers |
| Number of pages | 6 |
| ISBN (Electronic) | 979-8-3315-9515-9 |
| DOIs | |
| Publication status | Published - 30 Sept 2025 |
| Event | 25th IEEE International Conference on Environment and Electrical Engineering, & 9th IEEE Industrial and Commercial Power Systems Europe, EEEIC / I&CPS Europe - Chania, Greece Duration: 15 Jul 2025 → 18 Jul 2025 |
Conference
| Conference | 25th IEEE International Conference on Environment and Electrical Engineering, & 9th IEEE Industrial and Commercial Power Systems Europe, EEEIC / I&CPS Europe |
|---|---|
| Country/Territory | Greece |
| City | Chania |
| Period | 15/07/25 → 18/07/25 |
Funding
This publication is part of the research program 'MEGAMIND - Enabling distributed operation of energy infrastructures through Measuring, Gathering, Mining and Integrating grid-edge Data', (partly) financed by the Dutch Research Council (NWO), through the Perspectief funding instrument under number P19-25.
| Funders | Funder number |
|---|---|
| Nederlandse Organisatie voor Wetenschappelijk Onderzoek | P19-25 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 7 Affordable and Clean Energy
Keywords
- Local Flexibility Market
- Machine Learning
- Reinforcement Learning
- Strategic Bidding
- Surrogate Model
Fingerprint
Dive into the research topics of 'Surrogate Model-Based Reinforcement Learning for Bidding Strategies in Local Flexibility Markets'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver