This decommissioned ERA site remains active temporarily to support our final migration steps to https://ualberta.scholaris.ca, ERA's new home. All new collections and items, including Spring 2025 theses, are at that site. For assistance, please contact erahelp@ualberta.ca.

Search

Filter

Author / Creator / Contributor

1Szepesvari, Csaba

Subject / Keyword

1Active learning
1Actor-critic methods
1Artificial Intelligence
1Bias-variance tradeoff
1Function approximation
1L:east-sqares methods

1Machine Learning
1Markov decision processes
1Monte-Carlo methods
1Natural gradient

Show 4 more ...

Year

Collections

1Computing Science, Department of
1Computing Science, Department of/Technical Reports (Computing Science)

Languages

1English

Item type

1Report

Reinforcement Learning Algorithms for MDPs
Download

2009

Szepesvari, Csaba

Technical report TR09-13. This article presents a survey of reinforcement learning algorithms for Markov Decision Processes (MDP). In the first half of the article, the problem of value estimation is considered. Here we start by describing the idea of bootstrapping and temporal difference...

1 - 1 of 1

Search

Items (1)

Collections

Communities

Reinforcement Learning Algorithms for MDPs