Recursive self-tuning control of finite Markov chains

Vivek Borkar

doi:10.4064/am-24-2-169-188

Publishing house / Journals and Serials / Applicationes Mathematicae / All issues

Search for IMPAN publications

Recursive self-tuning control of finite Markov chains

Volume 24 / 1997

Vivek Borkar Applicationes Mathematicae 24 (1997), 169-188 DOI: 10.4064/am-24-2-169-188

Abstract

A recursive self-tuning control scheme for finite Markov chains is proposed wherein the unknown parameter is estimated by a stochastic approximation scheme for maximizing the log-likelihood function and the control is obtained via a relative value iteration algorithm. The analysis uses the asymptotic o.d.e.s associated with these.

Authors

Vivek Borkar

Free download under CC-BY license

Search for IMPAN publications

Publishing house / Journals and Serials / Applicationes Mathematicae / All issues

Applicationes Mathematicae

Recursive self-tuning control of finite Markov chains

Volume 24 / 1997

Abstract

Authors

Search for IMPAN publications

Rewrite code from the image