Search

Filter

Subject / Keyword

Show 4 more ...

Author / Creator / Contributor

Show 4 more ...

Year

Collections

Show 2 more ...

Languages

Item type

Departments

Supervisors

Show 4 more ...

Reinforcement Learning Algorithmic Adaptation to Machine Hardware Faults
Download

Spring 2021

Schoepp, Sheila

On July 20, 1969, the Apollo 11 lunar module, with Astronauts Neil Armstrong and Buzz Aldrin aboard, landed on the moon. It was a great achievement in space exploration. Most people know of this mission's success; yet, there is an untold story about this mission that many people are not aware...
Reinforcement Learning based Controller Design for Nonlinear Process Control
Download

Spring 2020

Shafi, Hareem

Reinforcement learning (RL) has received wide attention in various fields lately. Model-free RL brings data-driven solutions that learn the control strategy directly from interaction with process data without the need for a process model. This is especially beneficial in the case of nonlinear...
Reinforcement Learning on Resource Bounded Systems
Download

Spring 2018

Travnik, Jaden

Recent advancements in reinforcement learning have made the field interesting to academia and industry alike. Many of these advancements depend on deep learning as a means to approximate a value function or a policy. This dependency usually relies on high performance hardware (e.g., a graphics...
Reinforcement Learning-Driven Local Transactive Energy Market for Distributed Energy Resources
Download

Fall 2023

Zhang, Shida

Technological breakthroughs in renewable power generation, battery storage, electric mobility, and advanced data logistics are changing the electric grid. The huge influx of distributed energy resources (DERs), while important to curb carbon emissions, is not without consequences. The highly...
Sample-Efficient Control with Directed Exploration in Discounted MDPs Under Linear Function Approximation
Download

Spring 2022

Kumaraswamy, Raksha K

An important goal of online reinforcement learning algorithms is efficient data collection to learn near-optimal behaviour, that is, optimizing the exploration-exploitation trade-off to reduce the sample-complexity of learning. To improve sample-complexity of learning it is essential that the...
Selective Dyna-style Planning Using Neural Network Models with Limited Capacity
Download

Spring 2020

Zaheer, Muhammad

In model-based reinforcement learning, planning with an imperfect model of the environment has the potential to harm learning progress. But even when a model is imperfect, it may still contain information that is useful for planning. In this thesis, we investigate the idea of using an imperfect...
Sequence Labeling and Transduction with Output-Adjusted Actor-Critic Training of RNNs
Download

Fall 2018

Najafi, Saeed

Neural approaches to sequence labeling often use a Conditional Random Field (CRF) to model their output dependencies, while Recurrent Neural Networks (RNN) are used for the same purpose in other tasks. We set out to establish RNNs as an attractive alternative to CRFs for sequence labeling. To do...
Solving Common-Payoff Games with Approximate Policy Iteration
Download

Fall 2020

Sokota, Samuel

For artificially intelligent learning systems to be deployed widely in real-world settings, it is important that they be able to operate decentrally. Unfortunately, decentralized control is challenging. Even finding approximately optimal joint policies of decentralized partially observable Markov...
Sparse Representation Neural Networks for Online Reinforcement Learning
Download

Fall 2019

Liu, Vincent

In this thesis, we investigate sparse representations in reinforcement learning. We begin by discussing catastrophic interference in reinforcement learning with function approximation, and empirically investigating difficulties of online reinforcement learning in both policy evaluation and...
Stable Dynamic Programming and Reinforcement Learning with Dual Representations
Download

2007

Wang, Tao, Schuurmans, Dale, Bowling, Michael, Lizotte, Daniel

Technical report TR07-05. We investigate novel, dual algorithms for dynamic programming and reinforcement learning, based on maintaining explicit representations of stationary distributions instead of value functions. In particular, we investigate the convergence properties of standard dynamic...