Search

Filter

Departments

Author / Creator / Contributor

Show 4 more ...

Subject / Keyword

Show 4 more ...

Year

Collections

Languages

1418English

Item type

1418Thesis

Supervisors

Show 4 more ...

Online Agent Modelling in Human-Scale Problems
Download

Spring 2016

Bard, Nolan DC

Ideal agent behaviour in multiagent environments depends on the behaviour of other agents. Consequently, acting to maximize utility is challenging since an agent must gather and exploit knowledge about how the other (potentially adaptive) agents behave. In this thesis, we investigate how an...
Online Learning for Linearly Parametrized Control Problems
Download

Spring 2013

Abbasi-Yadkori, Yasin

In a discrete-time online control problem, a learner makes an effort to control the state of an initially unknown environment so as to minimize the sum of the losses he suffers, where the losses are assumed to depend on the individual state-transitions. Various models of control problems have...
Online Learning under Partial Feedback
Download

Fall 2016

Wu, Yifan

In an online learning problem a player makes decisions in a sequential manner. In each round, the player receives some reward that depends on his action and an outcome generated by the environment while some feedback information about the outcome is revealed. The goal of the player can be...
Online Off-policy Prediction
Download

Spring 2022

Sina Ghiassian

In this dissertation, we study online off-policy temporal-difference learning algorithms, a class of reinforcement learning algorithms that can learn predictions in an efficient and scalable manner. The contributions of this dissertation are one of the two kinds: (1) empirically studying existing...
Online optimization for machine learning: parallelism, adaptivity, and model selection
Download

Fall 2019

Joulani, Pooria

We study three problems in the application, design, and analysis of online optimization algorithms for machine learning. First, we consider speeding-up the common task of k-fold cross-validation of online algorithms, and provide TreeCV, an algorithm that reduces the time penalty of k-fold...
Online Prediction of Mid-Flight Aircraft Trajectories with Multi-Timestep Markov Models
Download

Spring 2020

Pan, Yongzhen Arthur

Online trajectory prediction is central to the function of air traffic control of improving the flow of air traffic and preventing collisions, particularly considering the ever-increasing number of air travellers. In this thesis, we propose an approach to predict the mid-flight trajectory of an...
Online Predictions, RL and Water Treatment: A GVF Story
Download

Fall 2023

Janjua, Muhammad Kamran

We study the use of reinforcement-learning based prediction approaches for a real drinking-water treatment plant. Developing such a prediction system is a critical step on the path to optimizing and automating water treatment. Before that, there are many questions to answer about predictability...
Opera prima: nonobtuse triangulation
Download

1992

Lacesso, Winslowe.
Opponent modeling in poker: learning and acting in a hostile and uncertain environment
Download

2002

Davidson, John Aaron.
Opponent modelling and search in poker
Download

2006

Schauenberg, Terence Conrad