Abstract
Markovian decision processes are considered in the situation of discrete time. countable state space. and general decision space. By introducing a Banach space with a weighted supremum norm, conditions are derived, which guarantee convergence of successive approximations to the value function. These conditions are weaker then those required by the usual supnorm approach. Several properties of the successive approximations are derived.
Original language | English |
---|---|
Place of Publication | Eindhoven |
Publisher | Technische Hogeschool Eindhoven |
Number of pages | 12 |
Publication status | Published - 1974 |
Publication series
Name | Memorandum COSOR |
---|---|
Volume | 7413 |
ISSN (Print) | 0926-4493 |