Search
Skip to Search Results
Filter
Author / Creator / Contributor
Collections
Year
Languages
Item type
Departments
Supervisors
-
Spring 2013
Gradient-TD methods are a new family of learning algorithms that are stable and convergent under a wider range of conditions than previous reinforcement learning algorithms. In particular, gradient-TD algorithms enable off-policy problems---problems where the distribution of the data is different...
1 - 1 of 1