On adaptive control of a partially observed Markov chain

Giovanni Di Masi; Łukasz Stettner

doi:10.4064/am-22-2-165-180

Instytut Matematyczny Polskiej Akademii Nauk / pl / Wydawnictwa / Czasopisma IMPAN / Applicationes Mathematicae / Wszystkie zeszyty

Przeszukaj wydawnictwa IMPAN

On adaptive control of a partially observed Markov chain

Tom 22 / 1994

Giovanni Di Masi, Łukasz Stettner Applicationes Mathematicae 22 (1994), 165-180 DOI: 10.4064/am-22-2-165-180

Streszczenie

A control problem for a partially observable Markov chain depending on a parameter with long run average cost is studied. Using uniform ergodicity arguments it is shown that, for values of the parameter varying in a compact set, it is possible to consider only a finite number of nearly optimal controls based on the values of actually computable approximate filters. This leads to an algorithm that guarantees nearly selfoptimizing properties without identifiability conditions. The algorithm is based on probing control, whose cost is additionally assumed to be periodically observable.

Autorzy

Giovanni Di Masi
Łukasz Stettner

Pobierz zgodnie z CC-BY

Przeszukaj wydawnictwa IMPAN

Instytut Matematyczny Polskiej Akademii Nauk / pl / Wydawnictwa / Czasopisma IMPAN / Applicationes Mathematicae / Wszystkie zeszyty

Applicationes Mathematicae

On adaptive control of a partially observed Markov chain

Tom 22 / 1994

Streszczenie

Autorzy

Przeszukaj wydawnictwa IMPAN

Przepisz kod z obrazka