Search
Skip to Search Results- 25White, Martha (Computing Science)
- 3White, Adam (Computing Science)
- 1Bowling, Michael (Computing Science)
- 1Farahmand, Amir-massoud (Computer Science, University of Toronto)
- 1Fyshe, Alona (Computing Science)
- 1Greiner, Russell (Computing Science)
- 12Reinforcement Learning
- 6Machine Learning
- 3Neural Networks
- 3Reinforcement learning
- 2Dyna
- 2Exploration
-
Fall 2021
Structural credit assignment in neural networks is a long-standing problem, with a variety of alternatives to backpropagation proposed to allow for local training of nodes. One of the early strategies was to treat each node as an agent and use a reinforcement learning method called REINFORCE to...
-
Towards Practical Offline Reinforcement Learning: Sample Efficient Policy Selection and Evaluation
DownloadSpring 2024
Offline reinforcement learning (RL) involves learning policies from datasets, rather than online interaction. The dissertation first investigates a critical component in offline RL: offline policy selection (OPS). Given that most offline RL algorithms require careful hyperparameter tuning, we...
-
Spring 2024
Optimistic value estimates provide one mechanism for directed exploration in reinforcement learning (RL). The agent acts greedily with respect to an estimate of the value plus what can be seen as a value bonus. The value bonus can be learned by estimating a value function on reward bonuses,...
-
Fall 2019
In this thesis, we investigate different vector step-size adaptation approaches for continual, online prediction problems. Vanilla stochastic gradient descent can be considerably improved by scaling the update with a vector of appropriately chosen step-sizes. Many methods, including AdaGrad,...
-
Fall 2023
Oftentimes, machine learning applications using neural networks involve solving discrete optimization problems, such as in pruning, parameter-isolation-based continual learning and training of binary networks. Still, these discrete problems are combinatorial in nature and are also not amenable to...