Search

Filter

Subject / Keyword

Show 4 more ...

Author / Creator / Contributor

Show 4 more ...

Year

Collections

Show 2 more ...

Languages

22English

Item type

Departments

Supervisors

Show 4 more ...

Using Regret Estimation to Solve Games Compactly
Download

Spring 2016

Morrill, Dustin R

Game theoretic solution concepts, such as Nash equilibrium strategies that are optimal against worst case opponents, provide guidance in finding desirable autonomous agent behaviour. In particular, we wish to approximate solutions to complex, dynamic tasks, such as negotiation or bidding in...
Transforming online cultural safety training for self-directed, adult learners
Download

2016-08-26

MacIntyre, Gregory T.
The Role of Information in Online Learning
Download

Fall 2012

Bartók, Gábor

In a partial-monitoring game a player has to make decisions in a sequential manner. In each round, the player suffers some loss that depends on his decision and an outcome chosen by an opponent, after which he receives "some" information about the outcome. The goal of the player is to keep the...
Reinforcement Learning-based Process Control Under Sensory Uncertainty
Download

Spring 2023

Dogru, Oguzhan

Process industries involve processes that have complex, interdependent, and sometimes uncontrollable/unobservable features that are subject to a variety of uncertainties such as operational fluctuations, sensory noises, process anomalies, human involvement, market volatility, and so forth. In the...
Reinforcement Learning Algorithms for MDPs
Download

2009

Szepesvari, Csaba

Technical report TR09-13. This article presents a survey of reinforcement learning algorithms for Markov Decision Processes (MDP). In the first half of the article, the problem of value estimation is considered. Here we start by describing the idea of bootstrapping and temporal difference...
Recommender systems to support socio-collaborative learning in educational discussion forums
Download

Fall 2020

Chen, Zhaorui

With the popularity of online education, many educational technologies have been introduced to support students' learning. Among them, asynchronous discussion forums are widely used to support students’ socio-collaborative learning processes. However, the forum's complex thread structure and...
Online Off-policy Prediction
Download

Spring 2022

Sina Ghiassian

In this dissertation, we study online off-policy temporal-difference learning algorithms, a class of reinforcement learning algorithms that can learn predictions in an efficient and scalable manner. The contributions of this dissertation are one of the two kinds: (1) empirically studying existing...
Online Learning under Partial Feedback
Download

Fall 2016

Wu, Yifan

In an online learning problem a player makes decisions in a sequential manner. In each round, the player receives some reward that depends on his action and an outcome generated by the environment while some feedback information about the outcome is revealed. The goal of the player can be...
Online Agent Modelling in Human-Scale Problems
Download

Spring 2016

Bard, Nolan DC

Ideal agent behaviour in multiagent environments depends on the behaviour of other agents. Consequently, acting to maximize utility is challenging since an agent must gather and exploit knowledge about how the other (potentially adaptive) agents behave. In this thesis, we investigate how an...
On Local Regret
Download

2012

Bowling, Michael, Zinkevich, Martin

Online learning aims to perform nearly as well as the best hypothesis in hindsight. For some hypothesis classes, though, even finding the best hypothesis offline is challenging. In such offline cases, local search techniques are often employed and only local optimality guaranteed. For online...