Search
Skip to Search Results
Filter
Subject / Keyword
- 3Reinforcement learning
- 2Reinforcement Learning
- 1Active learning
- 1Actor-critic methods
- 1Artificial Intelligence
- 1Bias-variance tradeoff
Item type
Author / Creator / Contributor
Year
Collections
Languages
-
2007
Wang, Tao, Schuurmans, Dale, Bowling, Michael, Lizotte, Daniel
Technical report TR07-05. We investigate novel, dual algorithms for dynamic programming and reinforcement learning, based on maintaining explicit representations of stationary distributions instead of value functions. In particular, we investigate the convergence properties of standard dynamic...
1 - 3 of 3