Search

Filter

Supervisors

Show 3 more ...

Author / Creator / Contributor

Show 4 more ...

Subject / Keyword

Show 4 more ...

Year

Collections

Languages

15English

Item type

15Thesis

Departments

15Department of Computing Science

Online optimization for machine learning: parallelism, adaptivity, and model selection
Download

Fall 2019

Joulani, Pooria

We study three problems in the application, design, and analysis of online optimization algorithms for machine learning. First, we consider speeding-up the common task of k-fold cross-validation of online algorithms, and provide TreeCV, an algorithm that reduces the time penalty of k-fold...
Primal-Dual Algorithms for Learning in Constrained Markov Decision Processes
Download

Fall 2023

Liu, Chang

Many real-world tasks in fields such as robotics and control can be formulated as constrained Markov decision processes (CMDPs). In CMDPs, the objective is usually to optimize the return while ensuring some constraints being satisfied at the same time. The primal-dual approach is a common...
Probe-Efficient Learning
Download

Spring 2013

Zolghadr, Navid

This work introduces the “online probing” problem: In each round, the learner is able to purchase the values of a subset of features for the current instance. After the learner uses this information to produce a prediction for this instance, it then has the option of paying for seeing the full...
Pure Exploration in Multi-Armed Bandits
Download

Spring 2023

Stephens, Connor J

Many practical problems in fields ranging from online advertising to genomics can be framed as the task of selecting the best option from among several choices, based on a limited number of noisy evaluations of the quality of each choice. Pure exploration in multi-armed bandits is an...
Regularization in reinforcement learning
Download

Fall 2011

Farahmand, Amir-massoud

This thesis studies the reinforcement learning and planning problems that are modeled by a discounted Markov Decision Process (MDP) with a large state space and finite action space. We follow the value-based approach in which a function approximator is used to estimate the optimal value function....

11 - 15 of 15