Search
Skip to Search Results- 5Regret minimization
- 4Game theory
- 2Poker
- 1Artificial Intelligence
- 1Artificial intelligence
- 1Computer Games
- 2Bowling, Michael
- 2Zinkevich, Martin
- 1Gibson, Richard G
- 1Johanson, Michael
- 1Lanctot, Marc
- 1MacQueen, Revan
-
Fall 2023
Self-play is a technique for machine learning in multi-agent systems where a learning algorithm learns by interacting with copies of itself. Self-play is useful for generating large quantities of data for learning, but has the drawback that agents the learner will face post-training may have...
-
Monte Carlo Sampling and Regret Minimization for Equilibrium Computation and Decision-Making in Large Extensive Form Games
DownloadSpring 2013
In this thesis, we investigate the problem of decision-making in large two-player zero-sum games using Monte Carlo sampling and regret minimization methods. We demonstrate four major contributions. The first is Monte Carlo Counterfactual Regret Minimization (MCCFR): a generic family of...
-
2012
Bowling, Michael, Zinkevich, Martin
Online learning aims to perform nearly as well as the best hypothesis in hindsight. For some hypothesis classes, though, even finding the best hypothesis offline is challenging. In such offline cases, local search techniques are often employed and only local optimality guaranteed. For online...
-
Regret Minimization in Games and the Development of Champion Multiplayer Computer Poker-Playing Agents
DownloadSpring 2014
Recently, poker has emerged as a popular domain for investigating decision problems under conditions of uncertainty. Unlike traditional games such as checkers and chess, poker exhibits imperfect information, varying utilities, and stochastic events. Because of these complications, decisions at...
-
2007
Bowling, Michael, Johanson, Michael, Zinkevich, Martin, Piccione, Carmelo
Technical report TR07-14. Extensive games are a powerful model of multiagent decision-making scenarios with incomplete information. Finding a Nash equilibrium for very large instances of these games has received a great deal of recent attention. In this paper, we describe a new technique for...