This decommissioned ERA site remains active temporarily to support our final migration steps to https://ualberta.scholaris.ca, ERA's new home. All new collections and items, including Spring 2025 theses, are at that site. For assistance, please contact erahelp@ualberta.ca.

Search

Filter

Subject / Keyword

Show 2 more ...

Departments

2Department of Computing Science

Author / Creator / Contributor

Year

Collections

Languages

2English

Item type

2Thesis

Supervisors

Incremental Off-policy Reinforcement Learning Algorithms
Download

Fall 2017

Mahmood, Ashique

Model-free off-policy temporal-difference (TD) algorithms form a powerful component of scalable predictive knowledge representation due to their ability to learn numerous counter- factual predictions in a computationally scalable manner. In this dissertation, we address and overcome two...
Online Off-policy Prediction
Download

Spring 2022

Sina Ghiassian

In this dissertation, we study online off-policy temporal-difference learning algorithms, a class of reinforcement learning algorithms that can learn predictions in an efficient and scalable manner. The contributions of this dissertation are one of the two kinds: (1) empirically studying existing...

1 - 2 of 2

Search

Items (2)

Collections

Communities

Incremental Off-policy Reinforcement Learning Algorithms

Online Off-policy Prediction