This decommissioned ERA site remains active temporarily to support our final migration steps to https://ualberta.scholaris.ca, ERA's new home. All new collections and items, including Spring 2025 theses, are at that site. For assistance, please contact erahelp@ualberta.ca.

Search

Filter

Author / Creator / Contributor

1Bhatnagar, Shalabh
1Ghavamzadeh, Mohammad
1Lee, Mark
1Sutton, Richard

Subject / Keyword

1Actor-critic reinforcement learning algorithms
1Approximate dynamic programming
1Bootstrapping
1Function approximation
1Natural-gradient
1Policy gradient methods

1Temporal difference learning
1Two-timescale stochastic approximation

Show 2 more ...

Year

Collections

1Computing Science, Department of
1Computing Science, Department of/Technical Reports (Computing Science)

Languages

1English

Item type

1Report

Natural Actor - Critic Algorithms
Download

2009

Bhatnagar, Shalabh, Sutton, Richard, Ghavamzadeh, Mohammad, Lee, Mark

Technical report TR09-10. We present four new reinforcement learning algorithms based on actor-critic, function approximation, and natural gradient ideas, and we provide their convergence proofs. Actor-critic reinforcement learning methods are online approximations to policy iteration in which...

1 - 1 of 1

Search

Items (1)

Collections

Communities

Natural Actor - Critic Algorithms