This decommissioned ERA site remains active temporarily to support our final migration steps to https://ualberta.scholaris.ca, ERA's new home. All new collections and items, including Spring 2025 theses, are at that site. For assistance, please contact erahelp@ualberta.ca.

Search

Filter

Subject / Keyword

Show 4 more ...

Item type

35Thesis

Languages

35English

Author / Creator / Contributor

Show 4 more ...

Year

Collections

Departments

1Department of Physics

Show 1 more ...

Supervisors

Show 4 more ...

A Unified View of Multi-step Temporal Difference Learning
Download

Fall 2018

Kristopher De Asis

Temporal-difference (TD) learning is an important approach for predictive knowledge representation and sequential decision making. Within TD learning exists multi-step methods which unify one-step TD learning and Monte Carlo methods in a way where intermediate algorithms can outperform either...
Adaptive Decision Making in Dynamic Environments by Artificial and Biological Agents
Download

Fall 2023

Wispinski, Nathan J

The ability to adaptively respond to changing environments is a fundamental aspect of intelligent behaviour. From catching a ball in motion to changing one’s mind in the face of new information, adaptation requires several key cognitive mechanisms, such as the flexible integration of sensorimotor...
Advances in Distributional Reinforcement Learning: Bridging Theory with Algorithmic Practice
Download

Fall 2024

Sun, Ke

This thesis comprehensively investigates Distributional Reinforcement Learning~(RL), a vibrant research field that interplays between statistics and RL. As an extension of classical RL, distributional RL, on the one hand, embraces plenty of statistical ideas by incorporating distributional...
Agent-State Construction with Auxiliary Inputs
Download

Fall 2022

Tao, Ruo Yu

In most, if not every, realistic sequential decision-making tasks, the decision-making agent is not able to model the full complexity of the world. In reinforcement learning, the environment is often much larger and more complex than the agent, a setting also known as partial observability. In...
An Empirical Study of Model-Free Exploration for Deep Reinforcement Learning
Download

Fall 2021

Zhao, Xutong

Reinforcement learning (RL) is a learning paradigm focusing on how agents interact with an environment to maximize cumulative reward signals emitted from the environment. Exploration versus exploitation challenge is critical in RL research: the agent ought to trade off between taking the known...
Charging Schedule Optimization of Electric Buses Based on Reinforcement Learning
Download

Fall 2021

Chen, Wenzhuo

In recent years, due to the environmental concerns caused by the emissions from public transit services relying on traditional fossil fuels, the electrification of the public transit sector has attracted great attention from both automobile industry and academia. Specifically, the electric buses...
Dark Hex: A Large Scale Imperfect Information Game
Download

Fall 2022

Tapkan, Mustafa B

Imperfect information games model many large-scale real-world problems. Hex is the classic two-player zero-sum no-draw connection game where each player wants to join their two sides. Dark Hex is an imperfect information version of Hex in which each player sees only their own moves. Finding Nash...
Digital Twin and Smart Automation for Bitumen Extraction Process
Download

Spring 2024

Soesanto, Jansen

The advent of Industry 4.0 integrates advanced digital technologies and Artificial Intelligence (AI) into system engineering. This research explores the potential of AI in smart automation for industries, bridging it with physics-informed approaches, particularly through Explainable Artificial...
Ensembling Diverse Policies Improves Generalization of Deep Reinforcement Learning Algorithms to Environmental Changes in Continuous Control Tasks
Download

Fall 2023

Zhumabekov, Abilmansur

Deep Reinforcement Learning (DRL) algorithms have shown great success in solving continuous control tasks. However, they often struggle to generalize to changes in the environment. Although retraining may help policies adapt to changes, it may be quite costly in some environments. Ensemble...
Estimating Variance of Returns using Temporal Difference Methods
Download

Spring 2021

Bennett, Brendan

Temporal difference (TD) methods provide a powerful means of learning to make predictions in an online, model-free, and highly scalable manner. In the reinforcement learning (RL) framework, we formalize these prediction targets in terms of a (possibly discounted) sum of rewards, called the...