Search

Filter

Supervisors

Show 4 more ...

Author / Creator / Contributor

Show 4 more ...

Subject / Keyword

Show 4 more ...

Year

Collections

Languages

25English

Item type

25Thesis

Departments

25Department of Computing Science

A general framework for reducing variance in agent evaluation
Download

Spring 2010

White, Martha

In this work, we present a unified, general approach to variance reduction in agent evaluation using machine learning to minimize variance. Evaluating an agent's performance in a stochastic setting is necessary for agent development, scientific evaluation, and competitions. Traditionally,...
Abstraction in Large Extensive Games
Download

Fall 2009

Waugh, Kevin

For zero-sum games, we have efficient solution techniques. Unfortunately, there are interesting games that are too large to solve. Here, a popular approach is to solve an abstract game that models the original game. We assume that more accurate the abstract games result in stronger strategies....
Advances in Probabilistic Generative Models: Normalizing Flows, Multi-View Learning, and Linear Dynamical Systems
Download

Fall 2020

Karami, Mahdi

This thesis considers some aspects of generative models including my contributions in deep probabilistic generative architectures and linear dynamical systems. First, some advances in deep probabilistic generative models are contributed. Flow-based generative modelling is an emerging and highly...
Advances in Simulation-Based Search and Batch Reinforcement Learning
Download

Spring 2023

Xiao, Chenjun

Reinforcement learning (RL) defines a general computational problem where the learner must learn to make good decisions through interactive experience. To be effective in solving this problem, the learner must be able to explore the environment, make accurate predictions about the future, and...
Bandit Convex Optimization with Biased Noisy Gradient Oracles
Download

Spring 2017

Hu, Xiaowei

Optimizing an objective function over convex sets is a key problem in many different machine learning models. One of the various kinds of well studied objective functions is the convex function, where any local minimum must be the global mini- mum over the domain. To find the optimal point that...
Bregman Divergence Clustering: A Convex Approach
Download

Fall 2013

Cheng, Hao

Due to its wide application in various fields, clustering, as a fundamental unsupervised learning problem, has been intensively investigated over the past few decades. Unfortunately, standard clustering formulations are known to be computationally intractable. Although many convex relaxations of...
Convex Latent Modeling
Download

Spring 2017

Aslan,Ozlem

Most machine learning problems can be posed as solving a mathematical program that describes the structure of the prediction problem, usually expressed in terms of carefully chosen losses and regularizers. However, many machine learning problems yield mathematical programs that are not convex in...
Convex Regression: Theory, Practice, and Applications
Download

Fall 2016

Balazs, Gabor

This thesis explores theoretical, computational, and practical aspects of convex (shape-constrained) regression, providing new excess risk upper bounds, a comparison of convex regression techniques with theoretical guarantee, a novel heuristic training algorithm for max-affine representations,...
Differentially Private Algorithms for Efficient Online Matroid Optimization
Download

Fall 2023

Chandak, Kushagra

A matroid bandit is the online version of combinatorial optimization on a matroid, in which the learner chooses $K$ actions from a set of $L$ actions that can form a matroid basis. Many real-world applications such as recommendation systems can be modeled as matroid bandits. In such learning...
Fast gradient algorithms for structured sparsity
Download

Spring 2014

Yu, Yaoliang

Many machine learning problems can be formulated under the composite minimization framework which usually involves a smooth loss function and a nonsmooth regularizer. A lot of algorithms have thus been proposed and the main focus has been on first order gradient methods, due to their...