This decommissioned ERA site remains active temporarily to support our final migration steps to https://ualberta.scholaris.ca, ERA's new home. All new collections and items, including Spring 2025 theses, are at that site. For assistance, please contact erahelp@ualberta.ca.

Search

Filter

Subject / Keyword

Show 4 more ...

Author / Creator / Contributor

Show 4 more ...

Year

Collections

Languages

37English

Item type

Departments

1Department of Physics

Show 1 more ...

Supervisors

Show 4 more ...

Estimating Variance of Returns using Temporal Difference Methods
Download

Spring 2021

Bennett, Brendan

Temporal difference (TD) methods provide a powerful means of learning to make predictions in an online, model-free, and highly scalable manner. In the reinforcement learning (RL) framework, we formalize these prediction targets in terms of a (possibly discounted) sum of rewards, called the...
Evaluating Search Spaces for Programmatic Policies in POMDPs
Download

Spring 2024

Carvalho, Tales Henrique

Searching for programmatic policies to solve a reinforcement learning problem can be challenging, particularly when dealing with domain-specific languages (DSLs) that define policies with internal states for partially observable Markov decision processes (POMDPs). This is because they lead to...
Examining Bio-Inspired Approaches for Continual Reinforcement Learning
Download

Fall 2024

Mastikhina, Olya

Despite the brain's inherent ability to continually learn, biological insights are rarely applied to continual reinforcement learning (RL). This thesis addresses this gap by examining four under-investigated biologically-inspired modifications within the context of continual RL: energy...
Explorations in the Foundations of Value-based Reinforcement Learning
Download

Fall 2024

De Asis, Kris

Value-based reinforcement learning is an approach to sequential decision making in which decisions are informed by learned, long-horizon predictions of future reward. This dissertation aims to understand issues that value-based methods face and develop algorithmic ideas to address these issues....
Greedification Operators for Policy Optimization: Investigating Forward and Reverse KL Divergences
Download

Fall 2020

Chan, Alan

Policy gradient methods typically estimate both explicit policy and value functions. The long-extant view of policy gradient methods as approximate policy iteration---alternating between policy evaluation and policy improvement by greedification---is a helpful framework to elucidate algorithmic...
Greedy Actor-Critic: A New Conditional Cross-Entropy Method for Policy Improvement
Download

Fall 2022

Neumann,Samuel

Actor-Critics are a popular class of algorithms for control. Their ability to learn complex behaviours in continuous-action environments make them directly applicable to many real-world scenarios. These algorithms are composed of two parts - a critic and an actor. The critic learns to critique...
Hybrid Fuzzy System Dynamics–Fuzzy Agent-Based Modeling of Construction Labor Productivity
Download

Spring 2023

Kedir, Nebiyu Siraj

Construction labour productivity (CLP) is a key performance indicator for determining the success of construction undertakings, and notably affects the profitability of construction companies. To this effect, the construction industry and researchers have pursued better ways of addressing the CLP...
Intelligent user-specific motion planning and control of lower-limb exoskeletons
Download

Spring 2024

Khodaei Mehr, Javad

Recent strides in lower-limb exoskeleton development have significantly enhanced the potential for more effective rehabilitation and assistance for individuals with mobility impairments. Despite these advancements, the widespread adoption of exoskeletons demands improvements in both hardware and...
Learning Agent State Online with Recurrent Generate-and-Test
Download

Spring 2022

Samani, Abolfazl

The concept of state is fundamental to a reinforcement learning agent. The state is the input to the agent's action-selection policy, value functions, and environmental model. A reinforcement learning agent interacts with the environment by performing actions and receiving observations, resulting...
Leveraging Generic Problem Structure for Efficient Reinforcement Learning
Download

Spring 2024

Young, Kenneth J.

In this dissertation, I investigate how we can exploit generic problem structure to make reinforcement learning algorithms more efficient. Generic problem structure means basic structure that exists in a wide range of problems (e.g., an action taken in the present does not influence the past), as...