This decommissioned ERA site remains active temporarily to support our final migration steps to https://ualberta.scholaris.ca, ERA's new home. All new collections and items, including Spring 2025 theses, are at that site. For assistance, please contact erahelp@ualberta.ca.

Search

Filter

Subject / Keyword

1Action-Value Methods
1Function Approximation
1Reinforcement Learning
1Temporal-Difference Learning

Supervisors

1Richard Sutton (Computing Science)

Author / Creator / Contributor

1Juan Fernando Hernandez Garcia

Year

Collections

1Graduate and Postdoctoral Studies (GPS), Faculty of
1Graduate and Postdoctoral Studies (GPS), Faculty of/Theses and Dissertations

Languages

1English

Item type

1Thesis

Departments

1Department of Computing Science

Unifying n-Step Temporal-Difference Action-Value Methods
Download

Spring 2019

Juan Fernando Hernandez Garcia

Unifying seemingly disparate algorithmic ideas to produce better performing algorithms has been a longstanding goal in reinforcement learning. As a primary example, the TD(λ) algorithm elegantly unifies temporal difference (TD) methods with Monte Carlo methods through the use of eligibility...

1 - 1 of 1

Search

Items (1)

Collections

Communities

Unifying n-Step Temporal-Difference Action-Value Methods