This decommissioned ERA site remains active temporarily to support our final migration steps to https://ualberta.scholaris.ca, ERA's new home. All new collections and items, including Spring 2025 theses, are at that site. For assistance, please contact erahelp@ualberta.ca.

Search

Filter

Subject / Keyword

Show 2 more ...

Collections

Author / Creator / Contributor

Year

Languages

2English

Item type

2Thesis

Departments

2Department of Computing Science

Supervisors

A Unified View of Multi-step Temporal Difference Learning
Download

Fall 2018

Kristopher De Asis

Temporal-difference (TD) learning is an important approach for predictive knowledge representation and sequential decision making. Within TD learning exists multi-step methods which unify one-step TD learning and Monte Carlo methods in a way where intermediate algorithms can outperform either...
Explorations in the Foundations of Value-based Reinforcement Learning
Download

Fall 2024

De Asis, Kris

Value-based reinforcement learning is an approach to sequential decision making in which decisions are informed by learned, long-horizon predictions of future reward. This dissertation aims to understand issues that value-based methods face and develop algorithmic ideas to address these issues....

1 - 2 of 2

Search

Items (2)

Collections

Communities

A Unified View of Multi-step Temporal Difference Learning

Explorations in the Foundations of Value-based Reinforcement Learning