This decommissioned ERA site remains active temporarily to support our final migration steps to https://ualberta.scholaris.ca, ERA's new home. All new collections and items, including Spring 2025 theses, are at that site. For assistance, please contact erahelp@ualberta.ca.

Search

Filter

Subject / Keyword

Show 2 more ...

Collections

Author / Creator / Contributor

Year

Languages

3English

Item type

3Thesis

Departments

3Department of Computing Science

Supervisors

Differentially Private Algorithms for Efficient Online Matroid Optimization
Download

Fall 2023

Chandak, Kushagra

A matroid bandit is the online version of combinatorial optimization on a matroid, in which the learner chooses $K$ actions from a set of $L$ actions that can form a matroid basis. Many real-world applications such as recommendation systems can be modeled as matroid bandits. In such learning...
Online Learning under Partial Feedback
Download

Fall 2016

Wu, Yifan

In an online learning problem a player makes decisions in a sequential manner. In each round, the player receives some reward that depends on his action and an outcome generated by the environment while some feedback information about the outcome is revealed. The goal of the player can be...
Perturbed History Exploration in Stochastic Subgaussian Generalized Linear Bandits
Download

Fall 2023

Liu, Shuai

We consider stochastic generalized linear bandit (GLB) problems when the reward distributions are log-concave and subgaussian. We consider for this problem the perturbed history exploration (PHE) algorithmIn each round of its operation, PHE perturbs the observed rewards by adding fresh noise to...

1 - 3 of 3

Search

Items (3)

Collections

Communities

Differentially Private Algorithms for Efficient Online Matroid Optimization

Online Learning under Partial Feedback

Perturbed History Exploration in Stochastic Subgaussian Generalized Linear Bandits