Search
Skip to Search Results
Filter
Subject / Keyword
- 3Reinforcement learning
- 2Reinforcement Learning
- 1Active learning
- 1Actor-critic methods
- 1Artificial Intelligence
- 1Bias-variance tradeoff
Collections
Author / Creator / Contributor
Year
Languages
Item type
-
2007
Wang, Tao, Schuurmans, Dale, Bowling, Michael, Lizotte, Daniel
Technical report TR07-05. We investigate novel, dual algorithms for dynamic programming and reinforcement learning, based on maintaining explicit representations of stationary distributions instead of value functions. In particular, we investigate the convergence properties of standard dynamic...
1 - 3 of 3