Search

Filter

Subject / Keyword

Show 4 more ...

Author / Creator / Contributor

Show 4 more ...

Year

Collections

Show 4 more ...

Languages

Item type

Show 2 more ...

Departments

1Medical Sciences-Radiology and Diagnostic Imaging

Show 1 more ...

Supervisors

Show 4 more ...

On Local Regret
Download

2012

Bowling, Michael, Zinkevich, Martin

Online learning aims to perform nearly as well as the best hypothesis in hindsight. For some hypothesis classes, though, even finding the best hypothesis offline is challenging. In such offline cases, local search techniques are often employed and only local optimality guaranteed. For online...
Optimal Mechanisms for Machine Learning: A Game-Theoretic Approach to Designing Machine Learning Competitions
Download

Spring 2013

Ajallooeian, Mohammad Mahdi

In this thesis we consider problems where a self-interested entity, called the principal, has private access to some data that she wishes to use to solve a prediction problem by outsourcing the development of the predictor to some other parties. Assuming the principal, who needs the machine...
Outcome Prediction and Hierarchical Models in Real-Time Strategy Games
Download

Spring 2019

Stanescu, Adrian M

For many years, traditional boardgames such as Chess, Checkers or Go havebeen the standard environments to test new Artificial Intelligence (AI) algorithms for achieving robust game-playing agents capable of defeating the best human players. Presently, the focus has shifted...
Pinball: High-Speed Real-Time Tracking and Playing
Download

Fall 2011

Metcalf, Adam

Pinball is fast-paced arcade-style game of which the origins date back hundreds of years. Game playing robots exist for billiards, foosball, and soccer and each have their own unique challenges. The speed that balls move in pinball machines requires that players have quick reactions. We created...
Policy Selection for Transfer Learning in the Building Control Domain
Download

Fall 2023

Krishna Guruvayur Sasikumar, Aakash

The application of reinforcement learning (RL) to the optimal control of building systems has gained traction in recent years as it can reduce building energy consumption and improve human comfort, without requiring the knowledge of the building model. However, existing RL solutions for building...
Proceedings of Quantum Computing Summer School
Download

2002

Fortin, David, Antoniu, Angela, Sardarli, Arzu, Rezania, Vahid, Levner, Ilya, Bulitko, Vadim

Technical report TR02-14. The 2002 Quantum Computing Summer School (QCSS'02) at the University of Alberta was organized as a learning and discussion forum for researchers in Artificial Intelligence, Computer Science, Physics, Mathematics, and Engineering. The short-term objective was to introduce...
Question Answering for Biomedicine
Download

Fall 2016

Liu, Yifeng

The field of biomedicine is reeling from “information overload”. Indeed, biomedical researchers find it almost impossible to stay current with published literature due to the vast amounts of data being generated and published. As a result, they are turning to text mining. Over the past two...
Regret Minimization in Games and the Development of Champion Multiplayer Computer Poker-Playing Agents
Download

Spring 2014

Gibson, Richard G

Recently, poker has emerged as a popular domain for investigating decision problems under conditions of uncertainty. Unlike traditional games such as checkers and chess, poker exhibits imperfect information, varying utilities, and stochastic events. Because of these complications, decisions at...
Regret Minimization in Games with Incomplete Information
Download

2007

Bowling, Michael, Johanson, Michael, Zinkevich, Martin, Piccione, Carmelo

Technical report TR07-14. Extensive games are a powerful model of multiagent decision-making scenarios with incomplete information. Finding a Nash equilibrium for very large instances of these games has received a great deal of recent attention. In this paper, we describe a new technique for...
Reinforcement Learning Algorithms for MDPs
Download

2009

Szepesvari, Csaba

Technical report TR09-13. This article presents a survey of reinforcement learning algorithms for Markov Decision Processes (MDP). In the first half of the article, the problem of value estimation is considered. Here we start by describing the idea of bootstrapping and temporal difference...