This decommissioned ERA site remains active temporarily to support our final migration steps to https://ualberta.scholaris.ca, ERA's new home. All new collections and items, including Spring 2025 theses, are at that site. For assistance, please contact erahelp@ualberta.ca.

Search

Filter

Subject / Keyword

Show 4 more ...

Item type

6Thesis

Author / Creator / Contributor

Year

Collections

Languages

6English

Departments

Supervisors

Show 2 more ...

An Empirical Study of Exploration Strategies for Model-Free Reinforcement Learning
Download

Spring 2020

Yasui, Nikolaus Winget

Reinforcement Learning is a formalism for learning by trial and error. Unfortunately, trial and error can take a long time to find a solution if the agent does not efficiently explore the behaviours available to it. Moreover, how an agent ought to explore depends on the task that the agent is...
Efficient Exploration in Reinforcement Learning through Time-Based Representations
Download

Spring 2019

Cholodovskis Machado, Marlos

In the reinforcement learning (RL) problem an agent must learn how to act optimally through trial-and-error interactions with a complex, unknown, stochastic environment. The actions taken by the agent influence not just the immediate reward it observes but also the future states and rewards it...
Improving Deep Deterministic Policy Gradient for Sparse Reward and Goal-Conditioned Continuous Control
Download

Spring 2024

Futuhi, Ehsan

We propose an improved version of deep deterministic policy gradient (DDPG) for sparse reward and goal-conditioned reinforcement learning. To enhance exploration, we introduce \emph{${\epsilon}{t}$-greedy}, which uses search to generate exploratory options, focusing on less-visited states. We...
Sample-Efficient Control with Directed Exploration in Discounted MDPs Under Linear Function Approximation
Download

Spring 2022

Kumaraswamy, Raksha K

An important goal of online reinforcement learning algorithms is efficient data collection to learn near-optimal behaviour, that is, optimizing the exploration-exploitation trade-off to reduce the sample-complexity of learning. To improve sample-complexity of learning it is essential that the...
Using Visual Communication Design To Optimize Exploration of Large Text-Mining Datasets
Download

Spring 2016

Montague, John J

How can the principles and concepts applied by visual communication designers be used to assist in exploring and understanding the massive, complex volumes of data now available to Digital Humanities researchers? One method we might employ to help us more easily comprehend the implications of...
Value Bonuses Using Ensemble Errors For Exploration in Reinforcement Learning
Download

Spring 2024

Wahab, Abdul

Optimistic value estimates provide one mechanism for directed exploration in reinforcement learning (RL). The agent acts greedily with respect to an estimate of the value plus what can be seen as a value bonus. The value bonus can be learned by estimating a value function on reward bonuses,...

1 - 6 of 6

Search

Items (6)

Collections

Communities

An Empirical Study of Exploration Strategies for Model-Free Reinforcement Learning

Efficient Exploration in Reinforcement Learning through Time-Based Representations

Improving Deep Deterministic Policy Gradient for Sparse Reward and Goal-Conditioned Continuous Control

Sample-Efficient Control with Directed Exploration in Discounted MDPs Under Linear Function Approximation

Using Visual Communication Design To Optimize Exploration of Large Text-Mining Datasets

Value Bonuses Using Ensemble Errors For Exploration in Reinforcement Learning