Search

Filter

Departments

33Department of Computing Science

Languages

33English

Supervisors

Show 4 more ...

Author / Creator / Contributor

Show 4 more ...

Subject / Keyword

Show 4 more ...

Year

Collections

Item type

33Thesis

A Distribution Dependent Analysis of Meta-Learning
Download

Spring 2022

Konobeev, Mikhail

A key problem in the theory of meta-learning is to understand how the task distributions influence transfer risk, the expected error of a meta-learner on a new task drawn from the unknown task distribution. In this work, focusing on fixed design linear regression with Gaussian noise and a...
Agent-State Construction with Auxiliary Inputs
Download

Fall 2022

Tao, Ruo Yu

In most, if not every, realistic sequential decision-making tasks, the decision-making agent is not able to model the full complexity of the world. In reinforcement learning, the environment is often much larger and more complex than the agent, a setting also known as partial observability. In...
An Empirical Study of Experience Replay for Control in Continuous State Domains
Download

Fall 2022

Li, Xin

In this thesis, we investigate the empirical performance of several experience replay techniques. Efficient experience replay plays an important role in model-free reinforcement learning by improving sample efficiency through reusing past experience. However, replay-based methods were largely...
An Empirical Study of Exploration Strategies for Model-Free Reinforcement Learning
Download

Spring 2020

Yasui, Nikolaus Winget

Reinforcement Learning is a formalism for learning by trial and error. Unfortunately, trial and error can take a long time to find a solution if the agent does not efficiently explore the behaviours available to it. Moreover, how an agent ought to explore depends on the task that the agent is...
An Empirical Study of Model-Free Exploration for Deep Reinforcement Learning
Download

Fall 2021

Zhao, Xutong

Reinforcement learning (RL) is a learning paradigm focusing on how agents interact with an environment to maximize cumulative reward signals emitted from the environment. Exploration versus exploitation challenge is critical in RL research: the agent ought to trade off between taking the known...
Analysis of an Alternate Policy Gradient Estimator for Softmax Policies
Download

Spring 2022

Garg, Shivam

Policy gradient (PG) estimators are ineffective in dealing with softmax policies that are sub-optimally saturated, which refers to the situation when the policy concentrates its probability mass on sub-optimal actions. Sub-optimal policy saturation may arise from a bad policy initialization or a...
Characterizing Discrete Representations for Reinforcement Learning
Download

Fall 2023

Meyer, Edan J

In reinforcement learning (RL), agents learn to maximize a reward signal using nothing but observations from the environment as input to their decision making processes. Whether the agent is simple, consisting of only a policy that maps observations to actions, or complex, containing auxiliary...
Chasing Hallucinated Value: A Pitfall of Dyna Style Algorithms with Imperfect Environment Models
Download

Spring 2020

Jafferjee, Taher

In Dyna style algorithms, reinforcement learning (RL) agents use a model of the environment to generate simulated experience. By updating on this simulated experience, Dyna style algorithms allow agents to potentially learn control policies in fewer environment interactions than agents that use...
Continual Auxiliary Task Learning
Download

Fall 2021

McLeod, Matthew

Learning auxiliary tasks, such as multiple predictions about the world, can provide many benets to reinforcement learning systems. A variety of off-policy learning algorithms have been developed to learn such predictions, but as yet there is little work on how to adapt the behavior to gather...
Continuous Multilevel Actions in Reinforcement Learning
Download

Fall 2023

Mitchell, Daniel

Multilevel action selection is a reinforcement learning technique in which an action is broken into two parts, the type and the parameters. When using multilevel action selection in reinforcement learning, one must break the action space into multiple subsets. These subsets are typically disjoint...