Search

Filter

Subject / Keyword

Show 4 more ...

Author / Creator / Contributor

Show 4 more ...

Year

Collections

Show 2 more ...

Languages

Item type

Departments

Supervisors

Show 4 more ...

Strengths, Weaknesses, and Combinations of Model-based and Model-free Reinforcement Learning
Download

Spring 2016

Asadi Atui, Kavosh

Reinforcement learning algorithms are conventionally divided into two approaches: a model-based approach that builds a model of the environment and then computes a value function from the model, and a model-free approach that directly estimates the value function. The first contribution of this...
Structural Credit Assignment in Neural Networks using Reinforcement Learning
Download

Fall 2021

Gupta, Dhawal

Structural credit assignment in neural networks is a long-standing problem, with a variety of alternatives to backpropagation proposed to allow for local training of nodes. One of the early strategies was to treat each node as an agent and use a reinforcement learning method called REINFORCE to...
Sub-Neural Policies: Option Discovery via Neural Decomposition
Download

Spring 2024

Alikhasi, Mahdi

In reinforcement learning, agents solve problems through interactions with the environment. However, when faced with intricate environmental dynamics, learning can become challenging, resulting in sub-optimal policies. A potential remedy to this situation lies in the transfer of knowledge from...
Targeted Search Control in AlphaZero for Effective Policy Improvement
Download

Spring 2023

Trudeau, Alexandre

AlphaZero is a self-play reinforcement learning algorithm that achieves superhuman play in the games of chess, shogi, and Go via policy iteration. To be an effective policy improvement operator, AlphaZero’s search needs to have accurate value estimates for the states that appear in its search...
The Nature of Decision-Making: Human Behavior vs. Machine Learning
Download

2019-01-01

Beausoleil, Keeya
Toward Practical Reinforcement Learning Algorithms: Classification Based Policy Iteration and Model-Based Learning
Download

Spring 2017

Ávila Pires, Bernardo

In this dissertation, we advance the theoretical understanding of two families of Reinforcement Learning (RL) methods: Classification-based policy iteration (CBPI) and model-based reinforcement learning (MBRL) with factored semi-linear models. In contrast to generalized policy iteration, CBPI...
Two-Timescale Networks for Nonlinear Value Function Approximation
Download

Fall 2019

Chung, Wesley

Policy evaluation, learning value functions, is an integral part of the reinforcement learning problem. In this thesis, I propose a neural network architecture, the Two-Timescale Network (TTN), for value function approximation which utilizes linear function approximation for the value function...
Unifying n-Step Temporal-Difference Action-Value Methods
Download

Spring 2019

Juan Fernando Hernandez Garcia

Unifying seemingly disparate algorithmic ideas to produce better performing algorithms has been a longstanding goal in reinforcement learning. As a primary example, the TD(λ) algorithm elegantly unifies temporal difference (TD) methods with Monte Carlo methods through the use of eligibility...
Useful Policy Invariant Shaping from Arbitrary Advice
Download

Spring 2020

Behboudian, Paniz

Reinforcement learning (RL) is a powerful learning paradigm in which agents can learn to maximize sparse and delayed reward signals. Although RL has had many impressive successes in complex domains, learning can take hours, days, or even years of training data. A major challenge of contemporary...
Using Prior Data to Facilitate Learning and Inference in New Environments
Download

Spring 2021

Wen, Junfeng

This dissertation demonstrates how to utilize data collected previously from different sources to facilitate learning and inference for a target task. Learning from scratch for a target task or environment can be expensive and time-consuming. To address this problem, we make three contributions...