Abstract
Given a partially observable Markov decision process (POMDP) with finite state, input and measurement spaces, and costly measurements and control, we consider the problem of when to sample and actuate. Both sampling and actuation are modeled as control actions in a framework encompassing estimation and intervention problems. The process evolves freely between two consecutive control action times. Control actions are assumed to reset the conditional distribution of the state given the measurements to one of a finite number of distributions. We tackle the problem of deciding when control actions should occur in order to minimize an average cost that penalizes states and the rate of control actions. The problem is first shown to boil down to a stopping time problem. While the latter can be solved optimally, the complexity of the optimal policy is intractable. Thus, we propose two approximate methods. The first is inspired by relaxed dynamic programming, and it is within an additive cost factor of the optimal policy. The second is inspired by consistent event-triggered control and ensures that the cost is smaller than that of periodic control for the same control rate. We conclude that the latter policy can deal with large dimensional problems, as demonstrated in the context of precision agriculture.
Original language | English |
---|---|
Title of host publication | 2022 IEEE 61st Conference on Decision and Control, CDC 2022 |
Publisher | Institute of Electrical and Electronics Engineers |
Pages | 2399-2404 |
Number of pages | 6 |
ISBN (Electronic) | 978-1-6654-6761-2 |
DOIs | |
Publication status | Published - 10 Jan 2023 |
Event | 61st IEEE Conference on Decision and Control, CDC 2022 - The Marriott Cancún Collection, Cancun, Mexico Duration: 6 Dec 2022 → 9 Dec 2022 Conference number: 61 https://cdc2022.ieeecss.org/ |
Conference
Conference | 61st IEEE Conference on Decision and Control, CDC 2022 |
---|---|
Abbreviated title | CDC 2022 |
Country/Territory | Mexico |
City | Cancun |
Period | 6/12/22 → 9/12/22 |
Internet address |
Bibliographical note
Funding Information:The authors are with the Control Systems Technology Group, Department of Mechanical Engineering, Eindhoven University of Technology, the Netherlands. E-mails:{d.antunes, r.m.beumer, w.p.m.h.heemels, m.j.g.v.d.molengraft}@tue.nl. This research is part of the research program SYNERGIA (project number 17626), which is partly financed by the Dutch Research Council (NWO).
Funding
The authors are with the Control Systems Technology Group, Department of Mechanical Engineering, Eindhoven University of Technology, the Netherlands. E-mails:{d.antunes, r.m.beumer, w.p.m.h.heemels, m.j.g.v.d.molengraft}@tue.nl. This research is part of the research program SYNERGIA (project number 17626), which is partly financed by the Dutch Research Council (NWO).