This decommissioned ERA site remains active temporarily to support our final migration steps to https://ualberta.scholaris.ca, ERA's new home. All new collections and items, including Spring 2025 theses, are at that site. For assistance, please contact erahelp@ualberta.ca.

Search

Filter

Subject / Keyword

1Autonomous Robot
1Behavior Policy
1GQ lambda
1Greedy-GQ
1Learning In Parallel
1Mobile Robot

1Off-Policy
1Off-Policy Distance
1Options Learning
1Reinforcement Learning

Show 4 more ...

Item type

1Thesis

Supervisors

1Sutton, Richard (Computing Science)

Author / Creator / Contributor

1Delp, Michael

Year

Collections

1Graduate and Postdoctoral Studies (GPS), Faculty of
1Graduate and Postdoctoral Studies (GPS), Faculty of/Theses and Dissertations

Languages

1English

Departments

1Department of Computing Science

Experiments in off-policy reinforcement learning with the GQ(lambda) algorithm
Download

Spring 2011

Delp, Michael

Off-policy reinforcement learning is useful in many contexts. Maei, Sutton, Szepesvari, and others, have recently introduced a new class of algorithms, the most advanced of which is GQ(lambda), for off-policy reinforcement learning. These algorithms are the first stable methods for general...

1 - 1 of 1

Search

Items (1)

Collections

Communities

Experiments in off-policy reinforcement learning with the GQ(lambda) algorithm