This decommissioned ERA site remains active temporarily to support our final migration steps to https://ualberta.scholaris.ca, ERA's new home. All new collections and items, including Spring 2025 theses, are at that site. For assistance, please contact erahelp@ualberta.ca.
Search
Skip to Search Results
Filter
Subject / Keyword
- 3Reinforcement learning
- 2Reinforcement Learning
- 1Active learning
- 1Actor-critic methods
- 1Artificial Intelligence
- 1Bias-variance tradeoff
Item type
Author / Creator / Contributor
Year
Collections
Languages
-
2007
Wang, Tao, Schuurmans, Dale, Bowling, Michael, Lizotte, Daniel
Technical report TR07-05. We investigate novel, dual algorithms for dynamic programming and reinforcement learning, based on maintaining explicit representations of stationary distributions instead of value functions. In particular, we investigate the convergence properties of standard dynamic...
1 - 3 of 3