Search

Filter

Collections

Supervisors

Show 3 more ...

Author / Creator / Contributor

Show 4 more ...

Subject / Keyword

Show 4 more ...

Year

Languages

14English

Item type

14Thesis

Departments

14Department of Computing Science

Agent-State Construction with Auxiliary Inputs
Download

Fall 2022

Tao, Ruo Yu

In most, if not every, realistic sequential decision-making tasks, the decision-making agent is not able to model the full complexity of the world. In reinforcement learning, the environment is often much larger and more complex than the agent, a setting also known as partial observability. In...
An Empirical Study of Experience Replay for Control in Continuous State Domains
Download

Fall 2022

Li, Xin

In this thesis, we investigate the empirical performance of several experience replay techniques. Efficient experience replay plays an important role in model-free reinforcement learning by improving sample efficiency through reusing past experience. However, replay-based methods were largely...
An Empirical Study of Exploration Strategies for Model-Free Reinforcement Learning
Download

Spring 2020

Yasui, Nikolaus Winget

Reinforcement Learning is a formalism for learning by trial and error. Unfortunately, trial and error can take a long time to find a solution if the agent does not efficiently explore the behaviours available to it. Moreover, how an agent ought to explore depends on the task that the agent is...
An Empirical Study of Model-Free Exploration for Deep Reinforcement Learning
Download

Fall 2021

Zhao, Xutong

Reinforcement learning (RL) is a learning paradigm focusing on how agents interact with an environment to maximize cumulative reward signals emitted from the environment. Exploration versus exploitation challenge is critical in RL research: the agent ought to trade off between taking the known...
Characterizing Discrete Representations for Reinforcement Learning
Download

Fall 2023

Meyer, Edan J

In reinforcement learning (RL), agents learn to maximize a reward signal using nothing but observations from the environment as input to their decision making processes. Whether the agent is simple, consisting of only a policy that maps observations to actions, or complex, containing auxiliary...
Continual Auxiliary Task Learning
Download

Fall 2021

McLeod, Matthew

Learning auxiliary tasks, such as multiple predictions about the world, can provide many benets to reinforcement learning systems. A variety of off-policy learning algorithms have been developed to learn such predictions, but as yet there is little work on how to adapt the behavior to gather...
Entropy as a Measure of Puzzle Difficulty
Download

Fall 2023

Chen, You Chen Eugene

Evaluating and ranking the difficulty and enjoyment of puzzles is important in game design. Typically, such rankings are constructed manually for each specific game, which can be time consuming, subject to designer bias, and requires extensive play testing. An approach to ranking that generalizes...
Greedy Actor-Critic: A New Conditional Cross-Entropy Method for Policy Improvement
Download

Fall 2022

Neumann,Samuel

Actor-Critics are a popular class of algorithms for control. Their ability to learn complex behaviours in continuous-action environments make them directly applicable to many real-world scenarios. These algorithms are composed of two parts - a critic and an actor. The critic learns to critique...
Improving Water Treatment Using Reinforcement Learning
Download

Fall 2022

Liu, Puer

We have witnessed the rising popularity of real-world applications of reinforcement learning (RL). However, most successful real-world applications of RL rely on high-fidelity simulators that enable rapid iteration of prototypes, hyperparameter selection and policy training. On the other hand, RL...
Leveraging Off-Policy Prediction in Recurrent Networks for Reinforcement Learning
Download

Fall 2023

Schlegel, Matthew K

Partial observability---when the senses lack enough detail to make an optimal decision---is the reality of any decision making agent acting in the real world. While an agent could be made to make due with its available senses, taking advantage of the history of senses can provide more context and...