Search

Filter

Subject / Keyword

Show 2 more ...

Item type

1Report

Author / Creator / Contributor

Year

Collections

Languages

1English

Natural Actor - Critic Algorithms
Download

2009

Bhatnagar, Shalabh, Sutton, Richard, Ghavamzadeh, Mohammad, Lee, Mark

Technical report TR09-10. We present four new reinforcement learning algorithms based on actor-critic, function approximation, and natural gradient ideas, and we provide their convergence proofs. Actor-critic reinforcement learning methods are online approximations to policy iteration in which...

1 - 1 of 1