On the relative value iteration with a risk-sensitive criterion

Ari Arapostathis; Vivek S. Borkar

doi:10.4064/bc122-1

pl / Wydawnictwa / Banach Center Publications / Wszystkie tomy

On the relative value iteration with a risk-sensitive criterion

Tom 122 / 2020

Ari Arapostathis, Vivek S. Borkar Banach Center Publications 122 (2020), 9-24 MSC: Primary: 90C40, 93E20, 49K40; Secondary: 60J25, 60J60. DOI: 10.4064/bc122-1

Streszczenie

A multiplicative relative value iteration algorithm for solving the dynamic programming equation for the risk-sensitive control problem is studied for discrete time controlled Markov chains with a compact Polish state space, and controlled diffusions in the whole Euclidean space. The main result is a proof of convergence to the desired limit in each case.

Autorzy

Ari ArapostathisDepartment of Electrical and Computer Engineering
The University of Texas at Austin
2501 Speedway, EER 7.824
Austin, TX 78712, USA
e-mail
Vivek S. BorkarDepartment of Electrical Engineering
Indian Institute of Technology
Powai, Mumbai 400076, India
e-mail

Pobierz zgodnie z CC-BY

Przeszukaj wydawnictwa IMPAN

pl / Wydawnictwa / Banach Center Publications / Wszystkie tomy

Banach Center Publications

On the relative value iteration with a risk-sensitive criterion

Tom 122 / 2020

Streszczenie

Autorzy

Przeszukaj wydawnictwa IMPAN

Przepisz kod z obrazka