Skip to Search Results
  • 2009

    Szepesvari, Csaba

    Technical report TR09-13. This article presents a survey of reinforcement learning algorithms for Markov Decision Processes (MDP). In the first half of the article, the problem of value estimation is considered. Here we start by describing the idea of bootstrapping and temporal difference...

1 - 1 of 1