Search
Skip to Search Results- 15Szepesvari, Csaba (Computing Science)
- 4Schuurmans, Dale (Computing Science)
- 1Andras Gyorgy (Imperial College London, U.K.)
- 1Bowling, Michael (Computing Science)
- 1Greiner ,Russell (Computing Science)
- 1Greiner, Russell (Computing Science)
- 2Joulani, Pooria
- 1Abbasi-Yadkori, Yasin
- 1Afkanpour, Arash
- 1Aslan,Ozlem
- 1Balazs, Gabor
- 1Chandak, Kushagra
- 4Online Learning
- 3Learning theory
- 3Reinforcement Learning
- 2Machine Learning
- 2Machine learning
- 1Active learning
-
Fall 2023
Many real-world tasks in fields such as robotics and control can be formulated as constrained Markov decision processes (CMDPs). In CMDPs, the objective is usually to optimize the return while ensuring some constraints being satisfied at the same time. The primal-dual approach is a common...
-
Fall 2023
A matroid bandit is the online version of combinatorial optimization on a matroid, in which the learner chooses $K$ actions from a set of $L$ actions that can form a matroid basis. Many real-world applications such as recommendation systems can be modeled as matroid bandits. In such learning...
-
Fall 2019
We study three problems in the application, design, and analysis of online optimization algorithms for machine learning. First, we consider speeding-up the common task of k-fold cross-validation of online algorithms, and provide TreeCV, an algorithm that reduces the time penalty of k-fold...
-
Fall 2017
On the one hand, theoretical analyses of machine learning algorithms are typically performed based on various probabilistic assumptions about the data. While these probabilistic assumptions are important in the analyses, it is debatable whether such assumptions actually hold in practice. Another...
-
Spring 2017
Most machine learning problems can be posed as solving a mathematical program that describes the structure of the prediction problem, usually expressed in terms of carefully chosen losses and regularizers. However, many machine learning problems yield mathematical programs that are not convex in...
-
Spring 2017
Optimizing an objective function over convex sets is a key problem in many different machine learning models. One of the various kinds of well studied objective functions is the convex function, where any local minimum must be the global mini- mum over the domain. To find the optimal point that...
-
Fall 2016
This thesis explores theoretical, computational, and practical aspects of convex (shape-constrained) regression, providing new excess risk upper bounds, a comparison of convex regression techniques with theoretical guarantee, a novel heuristic training algorithm for max-affine representations,...
-
Spring 2013
In a discrete-time online control problem, a learner makes an effort to control the state of an initially unknown environment so as to minimize the sum of the losses he suffers, where the losses are assumed to depend on the individual state-transitions. Various models of control problems have...