Usman Ahmed Syed
Usman Ahmed Syed
Home
Experience
Projects
Publications
CV
Contact
Light
Dark
Automatic
TD learning
Exact behavior of TD learning algorithms
Use of Markov Jump Linear Systems theory for the finite time analysis of Temporal Difference learning algorithms.
Characterizing the Exact Behaviors of Temporal Difference Learning Algorithms Using Markov Jump Linear System Theory
In this paper, we provide a unified analysis of temporal difference learning algorithms with linear function approximators by …
Bin Hu
,
Usman Ahmed Syed
PDF
Cite
Project
Cite
×