Nonparametric adaptive control for discrete-time Markov processes with unbounded costs under average criterion

J. Minjárez-Sosa

doi:10.4064/am-26-3-267-280

Instytut Matematyczny Polskiej Akademii Nauk / pl / Wydawnictwa / Czasopisma IMPAN / Applicationes Mathematicae / Wszystkie zeszyty

Przeszukaj wydawnictwa IMPAN

Nonparametric adaptive control for discrete-time Markov processes with unbounded costs under average criterion

Tom 26 / 1999

J. Minjárez-Sosa Applicationes Mathematicae 26 (1999), 267-280 DOI: 10.4064/am-26-3-267-280

Streszczenie

We introduce average cost optimal adaptive policies in a class of discrete-time Markov control processes with Borel state and action spaces, allowing unbounded costs. The processes evolve according to the system equations $x_{t+1}=F(x_t,a_t,ξ _t)$, t=1,2,..., with i.i.d. $ℝ^k$-valued random vectors $ξ_t$, which are observable but whose density ϱ is unknown.

Autorzy

J. Minjárez-Sosa

Pobierz zgodnie z CC-BY

Przeszukaj wydawnictwa IMPAN

Instytut Matematyczny Polskiej Akademii Nauk / pl / Wydawnictwa / Czasopisma IMPAN / Applicationes Mathematicae / Wszystkie zeszyty

Applicationes Mathematicae

Nonparametric adaptive control for discrete-time Markov processes with unbounded costs under average criterion

Tom 26 / 1999

Streszczenie

Autorzy

Przeszukaj wydawnictwa IMPAN

Przepisz kod z obrazka