Search
Skip to Search Results- 84Artificial Intelligence
- 22Machine Learning
- 21Game theory
- 10Computer Games
- 10Reinforcement Learning
- 8Planning
- 4Müller, Martin
- 3Bowling, Michael
- 3Johanson, Michael
- 3Lanctot, Marc
- 3Mueller, Martin
- 3Zinkevich, Martin
- 65Graduate and Postdoctoral Studies (GPS), Faculty of
- 65Graduate and Postdoctoral Studies (GPS), Faculty of/Theses and Dissertations
- 21Computing Science, Department of
- 21Computing Science, Department of/Technical Reports (Computing Science)
- 4WISEST Summer Research Program
- 4WISEST Summer Research Program/WISEST Research Posters
-
2012
Bowling, Michael, Zinkevich, Martin
Online learning aims to perform nearly as well as the best hypothesis in hindsight. For some hypothesis classes, though, even finding the best hypothesis offline is challenging. In such offline cases, local search techniques are often employed and only local optimality guaranteed. For online...
-
Optimal Mechanisms for Machine Learning: A Game-Theoretic Approach to Designing Machine Learning Competitions
DownloadSpring 2013
In this thesis we consider problems where a self-interested entity, called the principal, has private access to some data that she wishes to use to solve a prediction problem by outsourcing the development of the predictor to some other parties. Assuming the principal, who needs the machine...
-
Fall 2011
Pinball is fast-paced arcade-style game of which the origins date back hundreds of years. Game playing robots exist for billiards, foosball, and soccer and each have their own unique challenges. The speed that balls move in pinball machines requires that players have quick reactions. We created...
-
Fall 2023
Krishna Guruvayur Sasikumar, Aakash
The application of reinforcement learning (RL) to the optimal control of building systems has gained traction in recent years as it can reduce building energy consumption and improve human comfort, without requiring the knowledge of the building model. However, existing RL solutions for building...
-
2002
Fortin, David, Antoniu, Angela, Sardarli, Arzu, Rezania, Vahid, Levner, Ilya, Bulitko, Vadim
Technical report TR02-14. The 2002 Quantum Computing Summer School (QCSS'02) at the University of Alberta was organized as a learning and discussion forum for researchers in Artificial Intelligence, Computer Science, Physics, Mathematics, and Engineering. The short-term objective was to introduce...
-
Fall 2016
The field of biomedicine is reeling from “information overload”. Indeed, biomedical researchers find it almost impossible to stay current with published literature due to the vast amounts of data being generated and published. As a result, they are turning to text mining. Over the past two...
-
Regret Minimization in Games and the Development of Champion Multiplayer Computer Poker-Playing Agents
DownloadSpring 2014
Recently, poker has emerged as a popular domain for investigating decision problems under conditions of uncertainty. Unlike traditional games such as checkers and chess, poker exhibits imperfect information, varying utilities, and stochastic events. Because of these complications, decisions at...
-
2007
Bowling, Michael, Johanson, Michael, Zinkevich, Martin, Piccione, Carmelo
Technical report TR07-14. Extensive games are a powerful model of multiagent decision-making scenarios with incomplete information. Finding a Nash equilibrium for very large instances of these games has received a great deal of recent attention. In this paper, we describe a new technique for...