This decommissioned ERA site remains active temporarily to support our final migration steps to https://ualberta.scholaris.ca, ERA's new home. All new collections and items, including Spring 2025 theses, are at that site. For assistance, please contact erahelp@ualberta.ca.

Search

Filter

Subject / Keyword

Show 4 more ...

Departments

10Department of Computing Science

Author / Creator / Contributor

Show 3 more ...

Year

Collections

Languages

10English

Item type

10Thesis

Supervisors

Show 4 more ...

Adapting to Non-stationarity in Online Learning
Download

Fall 2024

Jacobsen, Andrew

Over the last decade, machine learning (ML) has lead to advances in many fields, such as computer vision, online decision-making, robotics, natural language processing, and many others. The algorithms driving these successes typically have one or more user-specified free variables called...
Adaptive Monte Carlo Integration
Download

Spring 2016

Neufeld, James, P

Monte Carlo methods are a simple, effective, and widely deployed way of approximating integrals that prove too challenging for deterministic approaches. This thesis presents a number of contributions to the field of adaptive Monte Carlo methods. That is, approaches that automatically adjust the...
Investigating Generate and Test for Online Representation Search with Softmax Outputs
Download

Fall 2022

Elsayed, Mohamed

Modern representation learning methods perform well on offline tasks and primarily revolve around batch updates. However, batch updates preclude those methods from focusing on new experience, which is essential for fast online adaptation. In this thesis, we study an online and incremental...
Learning What to Remember: Strategies for Selective External Memory in Online Reinforcement Learning Agents
Download

Spring 2019

Young, Kenneth

In realistic environments, intelligent agents must learn to integrate information from their past to inform present decisions. An agent's immediate observations are often limited, and some degree of memory is necessary to complete many everyday tasks. However, an agent cannot remember everything...
Multi-Armed Bandit Problems under Delayed Feedback
Download

Fall 2012

Joulani, Pooria

In this thesis, the multi-armed bandit (MAB) problem in online learning is studied, when the feedback information is not observed immediately but rather after arbitrary, unknown, random delays. In the stochastic" setting when the rewards come from a fixed distribution, an algorithm is given that...
Online Learning for Linearly Parametrized Control Problems
Download

Spring 2013

Abbasi-Yadkori, Yasin

In a discrete-time online control problem, a learner makes an effort to control the state of an initially unknown environment so as to minimize the sum of the losses he suffers, where the losses are assumed to depend on the individual state-transitions. Various models of control problems have...
Online optimization for machine learning: parallelism, adaptivity, and model selection
Download

Fall 2019

Joulani, Pooria

We study three problems in the application, design, and analysis of online optimization algorithms for machine learning. First, we consider speeding-up the common task of k-fold cross-validation of online algorithms, and provide TreeCV, an algorithm that reduces the time penalty of k-fold...
Probe-Efficient Learning
Download

Spring 2013

Zolghadr, Navid

This work introduces the “online probing” problem: In each round, the learner is able to purchase the values of a subset of features for the current instance. After the learner uses this information to produce a prediction for this instance, it then has the option of paying for seeing the full...
Regret Minimization with Function Approximation in Extensive-Form Games
Download

Fall 2020

D'Orazio, Ryan

Computing a Nash equilibrium in zero-sum games, or more generally saddle point optimization, is a fundamental problem in game theory and machine learning, with applications spanning across a wide variety of domains, from generative modeling and computer vision to super-human AI in imperfect...
Using Regret Estimation to Solve Games Compactly
Download

Spring 2016

Morrill, Dustin R

Game theoretic solution concepts, such as Nash equilibrium strategies that are optimal against worst case opponents, provide guidance in finding desirable autonomous agent behaviour. In particular, we wish to approximate solutions to complex, dynamic tasks, such as negotiation or bidding in...