Samenvatting
In this paper, we analyze the asymptotic behaviour of the value function v(t) of an undiscounted continuous-time Markov decision chain. Both the state space and the action space are assumed to be finite. A new proof of the convergence of v(t) - tg is presented (where g denotes the maximal expected average reward over an infinite time horizon). Moreover, it is shown that this convergence is exponential.
Originele taal-2 | Engels |
---|---|
Pagina's (van-tot) | 700-717 |
Tijdschrift | Mathematics of Operations Research |
Volume | 12 |
Nummer van het tijdschrift | 4 |
DOI's | |
Status | Gepubliceerd - 1987 |