Search
Skip to Search Results
Filter
Author / Creator / Contributor
Subject / Keyword
Year
Collections
Languages
Item type
Departments
-
Spring 2015
Much of the focus on finding good representations in reinforcement learning has been on learning complex non-linear predictors of value. Methods like policy gradient, that do not learn a value function and instead directly represent policy, often need fewer parameters to learn good policies....
1 - 1 of 1