Search

Skip to Search Results
  • Spring 2013

    Hackman, Leah M

    Gradient-TD methods are a new family of learning algorithms that are stable and convergent under a wider range of conditions than previous reinforcement learning algorithms. In particular, gradient-TD algorithms enable off-policy problems---problems where the distribution of the data is different...

1 - 1 of 1