This decommissioned ERA site remains active temporarily to support our final migration steps to https://ualberta.scholaris.ca, ERA's new home. All new collections and items, including Spring 2025 theses, are at that site. For assistance, please contact erahelp@ualberta.ca.

Search

Filter

Subject / Keyword

Show 4 more ...

Departments

Collections

Author / Creator / Contributor

Show 4 more ...

Year

Languages

23English

Item type

23Thesis

Supervisors

Show 4 more ...

A Unified View of Multi-step Temporal Difference Learning
Download

Fall 2018

Kristopher De Asis

Temporal-difference (TD) learning is an important approach for predictive knowledge representation and sequential decision making. Within TD learning exists multi-step methods which unify one-step TD learning and Monte Carlo methods in a way where intermediate algorithms can outperform either...
Advances in Distributional Reinforcement Learning: Bridging Theory with Algorithmic Practice
Download

Fall 2024

Sun, Ke

This thesis comprehensively investigates Distributional Reinforcement Learning~(RL), a vibrant research field that interplays between statistics and RL. As an extension of classical RL, distributional RL, on the one hand, embraces plenty of statistical ideas by incorporating distributional...
Agent-State Construction with Auxiliary Inputs
Download

Fall 2022

Tao, Ruo Yu

In most, if not every, realistic sequential decision-making tasks, the decision-making agent is not able to model the full complexity of the world. In reinforcement learning, the environment is often much larger and more complex than the agent, a setting also known as partial observability. In...
An Empirical Study of Model-Free Exploration for Deep Reinforcement Learning
Download

Fall 2021

Zhao, Xutong

Reinforcement learning (RL) is a learning paradigm focusing on how agents interact with an environment to maximize cumulative reward signals emitted from the environment. Exploration versus exploitation challenge is critical in RL research: the agent ought to trade off between taking the known...
Dark Hex: A Large Scale Imperfect Information Game
Download

Fall 2022

Tapkan, Mustafa B

Imperfect information games model many large-scale real-world problems. Hex is the classic two-player zero-sum no-draw connection game where each player wants to join their two sides. Dark Hex is an imperfect information version of Hex in which each player sees only their own moves. Finding Nash...
Ensembling Diverse Policies Improves Generalization of Deep Reinforcement Learning Algorithms to Environmental Changes in Continuous Control Tasks
Download

Fall 2023

Zhumabekov, Abilmansur

Deep Reinforcement Learning (DRL) algorithms have shown great success in solving continuous control tasks. However, they often struggle to generalize to changes in the environment. Although retraining may help policies adapt to changes, it may be quite costly in some environments. Ensemble...
Estimating Variance of Returns using Temporal Difference Methods
Download

Spring 2021

Bennett, Brendan

Temporal difference (TD) methods provide a powerful means of learning to make predictions in an online, model-free, and highly scalable manner. In the reinforcement learning (RL) framework, we formalize these prediction targets in terms of a (possibly discounted) sum of rewards, called the...
Evaluating Search Spaces for Programmatic Policies in POMDPs
Download

Spring 2024

Carvalho, Tales Henrique

Searching for programmatic policies to solve a reinforcement learning problem can be challenging, particularly when dealing with domain-specific languages (DSLs) that define policies with internal states for partially observable Markov decision processes (POMDPs). This is because they lead to...
Examining Bio-Inspired Approaches for Continual Reinforcement Learning
Download

Fall 2024

Mastikhina, Olya

Despite the brain's inherent ability to continually learn, biological insights are rarely applied to continual reinforcement learning (RL). This thesis addresses this gap by examining four under-investigated biologically-inspired modifications within the context of continual RL: energy...
Explorations in the Foundations of Value-based Reinforcement Learning
Download

Fall 2024

De Asis, Kris

Value-based reinforcement learning is an approach to sequential decision making in which decisions are informed by learned, long-horizon predictions of future reward. This dissertation aims to understand issues that value-based methods face and develop algorithmic ideas to address these issues....