Search
Skip to Search Results- 14Schuurmans, Dale (Computing Science)
- 4Szepesvari, Csaba (Computing Science)
- 3Bowling, Michael (Computing Science)
- 2Greiner, Russell (Computing Science)
- 1Bowling, Mike (Computing Science)
- 1Müller, Martin (Computing Science)
- 5Machine learning
- 3Reinforcement Learning
- 2Machine Learning
- 1Abstractions
- 1Agent evaluation
- 1Artificial intelligence
-
Spring 2010
In this work, we present a unified, general approach to variance reduction in agent evaluation using machine learning to minimize variance. Evaluating an agent's performance in a stochastic setting is necessary for agent development, scientific evaluation, and competitions. Traditionally,...
-
Fall 2009
For zero-sum games, we have efficient solution techniques. Unfortunately, there are interesting games that are too large to solve. Here, a popular approach is to solve an abstract game that models the original game. We assume that more accurate the abstract games result in stronger strategies....
-
Advances in Probabilistic Generative Models: Normalizing Flows, Multi-View Learning, and Linear Dynamical Systems
DownloadFall 2020
This thesis considers some aspects of generative models including my contributions in deep probabilistic generative architectures and linear dynamical systems. First, some advances in deep probabilistic generative models are contributed. Flow-based generative modelling is an emerging and highly...
-
Spring 2023
Reinforcement learning (RL) defines a general computational problem where the learner must learn to make good decisions through interactive experience. To be effective in solving this problem, the learner must be able to explore the environment, make accurate predictions about the future, and...
-
Fall 2013
Due to its wide application in various fields, clustering, as a fundamental unsupervised learning problem, has been intensively investigated over the past few decades. Unfortunately, standard clustering formulations are known to be computationally intractable. Although many convex relaxations of...
-
Spring 2017
Most machine learning problems can be posed as solving a mathematical program that describes the structure of the prediction problem, usually expressed in terms of carefully chosen losses and regularizers. However, many machine learning problems yield mathematical programs that are not convex in...
-
Fall 2016
This thesis explores theoretical, computational, and practical aspects of convex (shape-constrained) regression, providing new excess risk upper bounds, a comparison of convex regression techniques with theoretical guarantee, a novel heuristic training algorithm for max-affine representations,...
-
Fall 2017
On the one hand, theoretical analyses of machine learning algorithms are typically performed based on various probabilistic assumptions about the data. While these probabilistic assumptions are important in the analyses, it is debatable whether such assumptions actually hold in practice. Another...
-
Fall 2012
This thesis provides a description of the cardiac rhythm as a latent chain of heart sound arrivals which occur over time, where each arrival generates a fixed window of observable data that can be described with arbitrary feature functions. This description of the process produces tractable...