학술논문

An optimality system for finite average Markov decision chains under risk-aversion
Document Type
TEXT
Source
Kybernetika | 2012 Volume:48 | Number:1
Subject
[83]-104
Language
English
Abstract
This work concerns controlled Markov chains with finite state space and compact action sets. The decision maker is risk-averse with constant risk-sensitivity, and the performance of a control policy is measured by the long-run average cost criterion. Under standard continuity-compactness conditions, it is shown that the (possibly non-constant) optimal value function is characterized by a system of optimality equations which allows to obtain an optimal stationary policy. Also, it is shown that the optimal superior and inferior limit average cost functions coincide.