Search

Filter

Languages

25English

Supervisors

Show 4 more ...

Author / Creator / Contributor

Show 4 more ...

Subject / Keyword

Show 4 more ...

Year

Collections

Item type

25Thesis

Departments

25Department of Computing Science

Greedy Pruning for Continually Adapting Networks
Download

Spring 2023

Shah, Haseeb

Gradient Descent algorithms suffer many problems when learning representations using fixed neural network architectures, such as reduced plasticity on non-stationary continual tasks and difficulty training sparse architectures from scratch. A common workaround is continuously adapting the neural...
How Brain-Like is an LSTM’s Representation of Nonsensical Language Stimuli?
Download

Fall 2021

Hashemzadeh, Maryam

The representations generated by many models of language (word embeddings, recurrent neural networks and transformers) correlate to brain activity recorded while people listen. However, these decoding results are usually based on the brain’s reaction to syntactically and semantically sound...
Improving Sample Efficiency of Online Temporal Difference Learning
Download

Fall 2021

Pan, Yangchen

A common scientific challenge for putting a reinforcement learning agent into practice is how to improve sample efficiency as much as possible with limited computational or memory resources. Such available physical resources may vary in different applications. My thesis introduces some approaches...
Improving the reliability of reinforcement learning algorithms through biconjugate Bellman errors
Download

Spring 2024

Patterson, Andrew

In this thesis, we seek to improve the reliability of reinforcement learning algorithms for nonlinear function approximation. Semi-gradient temporal difference (TD) update rules form the basis of most state-of-the-art value function learning systems despite clear counterexamples proving their...
Inferring Macroscopic Brain Connectomes via Group-Sparse Factorization
Download

Spring 2020

Farzane Aminmansour

Mapping the macrostructural connectivity of the living human brain is one of the primary goals of neuroscientists who study connectomics. The reconstruction of a brain's structural connectivity, aka its connectome, typically involves applying expert analysis to diffusion-weighted magnetic...
Leveraging Off-Policy Prediction in Recurrent Networks for Reinforcement Learning
Download

Fall 2023

Schlegel, Matthew K

Partial observability---when the senses lack enough detail to make an optimal decision---is the reality of any decision making agent acting in the real world. While an agent could be made to make due with its available senses, taking advantage of the history of senses can provide more context and...
Online Predictions, RL and Water Treatment: A GVF Story
Download

Fall 2023

Janjua, Muhammad Kamran

We study the use of reinforcement-learning based prediction approaches for a real drinking-water treatment plant. Developing such a prediction system is a critical step on the path to optimizing and automating water treatment. Before that, there are many questions to answer about predictability...
Selective Dyna-style Planning Using Neural Network Models with Limited Capacity
Download

Spring 2020

Zaheer, Muhammad

In model-based reinforcement learning, planning with an imperfect model of the environment has the potential to harm learning progress. But even when a model is imperfect, it may still contain information that is useful for planning. In this thesis, we investigate the idea of using an imperfect...
Solving Common-Payoff Games with Approximate Policy Iteration
Download

Fall 2020

Sokota, Samuel

For artificially intelligent learning systems to be deployed widely in real-world settings, it is important that they be able to operate decentrally. Unfortunately, decentralized control is challenging. Even finding approximately optimal joint policies of decentralized partially observable Markov...
Strange springs in many dimensions: how parametric resonance can explain divergence under covariate shift.
Download

Fall 2021

Banman, Kirby

Most convergence guarantees for stochastic gradient descent with momentum (SGDm) rely on independently and identically ditributed (iid) data sampling. Yet, SGDm is often used outside this regime, in settings with temporally correlated inputs such as continual learning and reinforcement learning....