Search

Filter

Author / Creator / Contributor

Show 4 more ...

Subject / Keyword

Show 4 more ...

Year

Collections

Languages

22English

Item type

22Thesis

Departments

22Department of Computing Science

Supervisors

Show 4 more ...

Results for "supervisors_tesim:"Szepesvari, Csaba (Computing Science)""

Differentially Private Algorithms for Efficient Online Matroid Optimization
Download

Fall 2023

Chandak, Kushagra

A matroid bandit is the online version of combinatorial optimization on a matroid, in which the learner chooses $K$ actions from a set of $L$ actions that can form a matroid basis. Many real-world applications such as recommendation systems can be modeled as matroid bandits. In such learning...
Primal-Dual Algorithms for Learning in Constrained Markov Decision Processes
Download

Fall 2023

Liu, Chang

Many real-world tasks in fields such as robotics and control can be formulated as constrained Markov decision processes (CMDPs). In CMDPs, the objective is usually to optimize the return while ensuring some constraints being satisfied at the same time. The primal-dual approach is a common...
Pure Exploration in Multi-Armed Bandits
Download

Spring 2023

Stephens, Connor J

Many practical problems in fields ranging from online advertising to genomics can be framed as the task of selecting the best option from among several choices, based on a limited number of noisy evaluations of the quality of each choice. Pure exploration in multi-armed bandits is an...
Optimized Batch Policy Evaluation in the Presence of Monotone Responses
Download

Spring 2021

Dong, Wang

In batch policy evaluation the goal is to predict the value of a policy given some historical data. A specific example, which motivated the approach pursued in this thesis, is to predict the probability of putting a natural wildfire out given some specific configuration of dispatched resources,...
Towards Sample Efficient Reinforcement Learning with Function Approximation
Download

Fall 2021

Ayoub, Alex

This thesis proposes novel algorithmic ideas in reinforcement learning for regret minimization. These algorithmic ideas enjoy nice theoretical guarantees and are more practical in large problems than their alternatives. We focus on finite-horizon episodic RL. We propose model-based and model-free...
Vector Step-size Adaptation for Continual, Online Prediction
Download

Fall 2019

Jacobsen, Andrew

In this thesis, we investigate different vector step-size adaptation approaches for continual, online prediction problems. Vanilla stochastic gradient descent can be considerably improved by scaling the update with a vector of appropriately chosen step-sizes. Many methods, including AdaGrad,...
Online optimization for machine learning: parallelism, adaptivity, and model selection
Download

Fall 2019

Joulani, Pooria

We study three problems in the application, design, and analysis of online optimization algorithms for machine learning. First, we consider speeding-up the common task of k-fold cross-validation of online algorithms, and provide TreeCV, an algorithm that reduces the time penalty of k-fold...
Convex Latent Modeling
Download

Spring 2017

Aslan,Ozlem

Most machine learning problems can be posed as solving a mathematical program that describes the structure of the prediction problem, usually expressed in terms of carefully chosen losses and regularizers. However, many machine learning problems yield mathematical programs that are not convex in...
Bandit Convex Optimization with Biased Noisy Gradient Oracles
Download

Spring 2017

Hu, Xiaowei

Optimizing an objective function over convex sets is a key problem in many different machine learning models. One of the various kinds of well studied objective functions is the convex function, where any local minimum must be the global mini- mum over the domain. To find the optimal point that...
Instance-dependent analysis of learning algorithms
Download

Fall 2017

Huang, Ruitong

On the one hand, theoretical analyses of machine learning algorithms are typically performed based on various probabilistic assumptions about the data. While these probabilistic assumptions are important in the analyses, it is debatable whether such assumptions actually hold in practice. Another...