On uniformly nearly-optimal Markov strategies

J. Wal, van der

Research output: Book/ReportReportAcademic

18 Downloads (Pure)

Abstract

In this paper the following result is proved. In any total reward countable state Markov decision process a Markov strategy p exists which is uniformly nearly-optimal in the following sense: v(p) = v* - e (e+u*) . Here v* denotes the value function of the process, u* denotes the value of the process if all negative rewards are neglected, and e is the unit function.
Original languageEnglish
Place of PublicationEindhoven
PublisherTechnische Hogeschool Eindhoven
Number of pages14
Publication statusPublished - 1981

Publication series

NameMemorandum COSOR
Volume8116
ISSN (Print)0926-4493

Fingerprint

Reward
Denote
Markov Decision Process
Value Function
Countable
Unit
Strategy

Cite this

Wal, van der, J. (1981). On uniformly nearly-optimal Markov strategies. (Memorandum COSOR; Vol. 8116). Eindhoven: Technische Hogeschool Eindhoven.
Wal, van der, J. / On uniformly nearly-optimal Markov strategies. Eindhoven : Technische Hogeschool Eindhoven, 1981. 14 p. (Memorandum COSOR).
@book{a5a617fad51e4c6e918afb668d2f0a36,
title = "On uniformly nearly-optimal Markov strategies",
abstract = "In this paper the following result is proved. In any total reward countable state Markov decision process a Markov strategy p exists which is uniformly nearly-optimal in the following sense: v(p) = v* - e (e+u*) . Here v* denotes the value function of the process, u* denotes the value of the process if all negative rewards are neglected, and e is the unit function.",
author = "{Wal, van der}, J.",
year = "1981",
language = "English",
series = "Memorandum COSOR",
publisher = "Technische Hogeschool Eindhoven",

}

Wal, van der, J 1981, On uniformly nearly-optimal Markov strategies. Memorandum COSOR, vol. 8116, Technische Hogeschool Eindhoven, Eindhoven.

On uniformly nearly-optimal Markov strategies. / Wal, van der, J.

Eindhoven : Technische Hogeschool Eindhoven, 1981. 14 p. (Memorandum COSOR; Vol. 8116).

Research output: Book/ReportReportAcademic

TY - BOOK

T1 - On uniformly nearly-optimal Markov strategies

AU - Wal, van der, J.

PY - 1981

Y1 - 1981

N2 - In this paper the following result is proved. In any total reward countable state Markov decision process a Markov strategy p exists which is uniformly nearly-optimal in the following sense: v(p) = v* - e (e+u*) . Here v* denotes the value function of the process, u* denotes the value of the process if all negative rewards are neglected, and e is the unit function.

AB - In this paper the following result is proved. In any total reward countable state Markov decision process a Markov strategy p exists which is uniformly nearly-optimal in the following sense: v(p) = v* - e (e+u*) . Here v* denotes the value function of the process, u* denotes the value of the process if all negative rewards are neglected, and e is the unit function.

M3 - Report

T3 - Memorandum COSOR

BT - On uniformly nearly-optimal Markov strategies

PB - Technische Hogeschool Eindhoven

CY - Eindhoven

ER -

Wal, van der J. On uniformly nearly-optimal Markov strategies. Eindhoven: Technische Hogeschool Eindhoven, 1981. 14 p. (Memorandum COSOR).