Bayesian parameter estimation and adaptive control of Markov processes with time-averaged cost

V. Borkar; S. Associate

doi:10.4064/am-25-3-339-358

Wydawnictwa / Czasopisma IMPAN / Applicationes Mathematicae / Wszystkie zeszyty

Przeszukaj wydawnictwa IMPAN

Bayesian parameter estimation and adaptive control of Markov processes with time-averaged cost

Tom 25 / 1998

V. Borkar, S. Associate Applicationes Mathematicae 25 (1998), 339-358 DOI: 10.4064/am-25-3-339-358

Streszczenie

This paper considers Bayesian parameter estimation and an associated adaptive control scheme for controlled Markov chains and diffusions with time-averaged cost. Asymptotic behaviour of the posterior law of the parameter given the observed trajectory is analyzed. This analysis suggests a "cost-biased" estimation scheme and associated self-tuning adaptive control. This is shown to be asymptotically optimal in the almost sure sense.

Autorzy

V. Borkar
S. Associate

Pobierz zgodnie z CC-BY

Przeszukaj wydawnictwa IMPAN

Wydawnictwa / Czasopisma IMPAN / Applicationes Mathematicae / Wszystkie zeszyty

Applicationes Mathematicae

Bayesian parameter estimation and adaptive control of Markov processes with time-averaged cost

Tom 25 / 1998

Streszczenie

Autorzy

Przeszukaj wydawnictwa IMPAN

Przepisz kod z obrazka