Temporal logic control of general Markov decision processes by approximate policy refinement

Sofie Haesaert, Sadegh Soudjani, Alessandro Abate

Onderzoeksoutput: Bijdrage aan tijdschriftTijdschriftartikelAcademicpeer review

2 Citaties (Scopus)

Uittreksel

The formal verification and controller synthesis for general Markov decision processes (gMDPs) that evolve over uncountable state spaces are computationally hard and thus generally rely on the use of approximate abstractions. In this paper, we contribute to the state of the art of control synthesis for temporal logic properties by computing and quantifying a less conservative gridding of the continuous state space of linear stochastic dynamic systems and by giving a new approach for control synthesis and verification that is robust to the incurred approximation errors. The approximation errors are expressed as both deviations in the outputs of the gMDPs and in the probabilistic transitions.

TaalEngels
Pagina's73-78
Aantal pagina's6
TijdschriftIFAC-PapersOnLine
Volume51
Nummer van het tijdschrift16
DOI's
StatusGepubliceerd - 1 jan 2018

Vingerafdruk

Temporal logic
Dynamical systems
Controllers

Citeer dit

Haesaert, Sofie ; Soudjani, Sadegh ; Abate, Alessandro. / Temporal logic control of general Markov decision processes by approximate policy refinement. In: IFAC-PapersOnLine. 2018 ; Vol. 51, Nr. 16. blz. 73-78
@article{e7cec9273efc414eb561f4c9b42ade6e,
title = "Temporal logic control of general Markov decision processes by approximate policy refinement",
abstract = "The formal verification and controller synthesis for general Markov decision processes (gMDPs) that evolve over uncountable state spaces are computationally hard and thus generally rely on the use of approximate abstractions. In this paper, we contribute to the state of the art of control synthesis for temporal logic properties by computing and quantifying a less conservative gridding of the continuous state space of linear stochastic dynamic systems and by giving a new approach for control synthesis and verification that is robust to the incurred approximation errors. The approximation errors are expressed as both deviations in the outputs of the gMDPs and in the probabilistic transitions.",
author = "Sofie Haesaert and Sadegh Soudjani and Alessandro Abate",
year = "2018",
month = "1",
day = "1",
doi = "10.1016/j.ifacol.2018.08.013",
language = "English",
volume = "51",
pages = "73--78",
journal = "IFAC-PapersOnLine",
issn = "2405-8963",
publisher = "Elsevier",
number = "16",

}

Temporal logic control of general Markov decision processes by approximate policy refinement. / Haesaert, Sofie; Soudjani, Sadegh; Abate, Alessandro.

In: IFAC-PapersOnLine, Vol. 51, Nr. 16, 01.01.2018, blz. 73-78.

Onderzoeksoutput: Bijdrage aan tijdschriftTijdschriftartikelAcademicpeer review

TY - JOUR

T1 - Temporal logic control of general Markov decision processes by approximate policy refinement

AU - Haesaert,Sofie

AU - Soudjani,Sadegh

AU - Abate,Alessandro

PY - 2018/1/1

Y1 - 2018/1/1

N2 - The formal verification and controller synthesis for general Markov decision processes (gMDPs) that evolve over uncountable state spaces are computationally hard and thus generally rely on the use of approximate abstractions. In this paper, we contribute to the state of the art of control synthesis for temporal logic properties by computing and quantifying a less conservative gridding of the continuous state space of linear stochastic dynamic systems and by giving a new approach for control synthesis and verification that is robust to the incurred approximation errors. The approximation errors are expressed as both deviations in the outputs of the gMDPs and in the probabilistic transitions.

AB - The formal verification and controller synthesis for general Markov decision processes (gMDPs) that evolve over uncountable state spaces are computationally hard and thus generally rely on the use of approximate abstractions. In this paper, we contribute to the state of the art of control synthesis for temporal logic properties by computing and quantifying a less conservative gridding of the continuous state space of linear stochastic dynamic systems and by giving a new approach for control synthesis and verification that is robust to the incurred approximation errors. The approximation errors are expressed as both deviations in the outputs of the gMDPs and in the probabilistic transitions.

UR - http://www.scopus.com/inward/record.url?scp=85052620838&partnerID=8YFLogxK

U2 - 10.1016/j.ifacol.2018.08.013

DO - 10.1016/j.ifacol.2018.08.013

M3 - Article

VL - 51

SP - 73

EP - 78

JO - IFAC-PapersOnLine

T2 - IFAC-PapersOnLine

JF - IFAC-PapersOnLine

SN - 2405-8963

IS - 16

ER -