This decommissioned ERA site remains active temporarily to support our final migration steps to https://ualberta.scholaris.ca, ERA's new home. All new collections and items, including Spring 2025 theses, are at that site. For assistance, please contact erahelp@ualberta.ca.

Search

Filter

Subject / Keyword

Show 4 more ...

Supervisors

2Sutton, Richard (Computing Science)

Author / Creator / Contributor

Year

Collections

Languages

2English

Item type

2Thesis

Departments

2Department of Computing Science

Experiments in off-policy reinforcement learning with the GQ(lambda) algorithm
Download

Spring 2011

Delp, Michael

Off-policy reinforcement learning is useful in many contexts. Maei, Sutton, Szepesvari, and others, have recently introduced a new class of algorithms, the most advanced of which is GQ(lambda), for off-policy reinforcement learning. These algorithms are the first stable methods for general...
Faster Gradient-TD Algorithms
Download

Spring 2013

Hackman, Leah M

Gradient-TD methods are a new family of learning algorithms that are stable and convergent under a wider range of conditions than previous reinforcement learning algorithms. In particular, gradient-TD algorithms enable off-policy problems---problems where the distribution of the data is different...

1 - 2 of 2

Search

Items (2)

Collections

Communities

Experiments in off-policy reinforcement learning with the GQ(lambda) algorithm

Faster Gradient-TD Algorithms