Search

Filter

Departments

45Department of Computing Science

Languages

45English

Supervisors

Show 4 more ...

Author / Creator / Contributor

Show 4 more ...

Subject / Keyword

Show 4 more ...

Year

Collections

Item type

45Thesis

Using Regret Estimation to Solve Games Compactly
Download

Spring 2016

Morrill, Dustin R

Game theoretic solution concepts, such as Nash equilibrium strategies that are optimal against worst case opponents, provide guidance in finding desirable autonomous agent behaviour. In particular, we wish to approximate solutions to complex, dynamic tasks, such as negotiation or bidding in...
Using Response Functions for Strategy Training and Evaluation
Download

Fall 2015

Davis, Trevor

Extensive-form games are a powerful framework for modeling sequential multi-agent interactions. In extensive-form games with imperfect information, Nash equilibria are generally used as a solution concept, but computing a Nash equilibrium can be intractable in large games. Instead, a variety of...
Value Bonuses Using Ensemble Errors For Exploration in Reinforcement Learning
Download

Spring 2024

Wahab, Abdul

Optimistic value estimates provide one mechanism for directed exploration in reinforcement learning (RL). The agent acts greedily with respect to an estimate of the value plus what can be seen as a value bonus. The value bonus can be learned by estimating a value function on reward bonuses,...
Vector Step-size Adaptation for Continual, Online Prediction
Download

Fall 2019

Jacobsen, Andrew

In this thesis, we investigate different vector step-size adaptation approaches for continual, online prediction problems. Vanilla stochastic gradient descent can be considerably improved by scaling the update with a vector of appropriately chosen step-sizes. Many methods, including AdaGrad,...
What to do when your discrete optimization is the size of a neural network?
Download

Fall 2023

Silva, Hugo Luis A

Oftentimes, machine learning applications using neural networks involve solving discrete optimization problems, such as in pruning, parameter-isolation-based continual learning and training of binary networks. Still, these discrete problems are combinatorial in nature and are also not amenable to...

41 - 45 of 45