Search

Filter

Subject / Keyword

Show 4 more ...

Languages

5English

Author / Creator / Contributor

1Piccione, Carmelo

Show 1 more ...

Year

Collections

Item type

Departments

3Department of Computing Science

Supervisors

Guarantees for Self-Play via Polymatrix Decomposability
Download

Fall 2023

MacQueen, Revan

Self-play is a technique for machine learning in multi-agent systems where a learning algorithm learns by interacting with copies of itself. Self-play is useful for generating large quantities of data for learning, but has the drawback that agents the learner will face post-training may have...
Monte Carlo Sampling and Regret Minimization for Equilibrium Computation and Decision-Making in Large Extensive Form Games
Download

Spring 2013

Lanctot, Marc

In this thesis, we investigate the problem of decision-making in large two-player zero-sum games using Monte Carlo sampling and regret minimization methods. We demonstrate four major contributions. The first is Monte Carlo Counterfactual Regret Minimization (MCCFR): a generic family of...
On Local Regret
Download

2012

Bowling, Michael, Zinkevich, Martin

Online learning aims to perform nearly as well as the best hypothesis in hindsight. For some hypothesis classes, though, even finding the best hypothesis offline is challenging. In such offline cases, local search techniques are often employed and only local optimality guaranteed. For online...
Regret Minimization in Games and the Development of Champion Multiplayer Computer Poker-Playing Agents
Download

Spring 2014

Gibson, Richard G

Recently, poker has emerged as a popular domain for investigating decision problems under conditions of uncertainty. Unlike traditional games such as checkers and chess, poker exhibits imperfect information, varying utilities, and stochastic events. Because of these complications, decisions at...
Regret Minimization in Games with Incomplete Information
Download

2007

Bowling, Michael, Johanson, Michael, Zinkevich, Martin, Piccione, Carmelo

Technical report TR07-14. Extensive games are a powerful model of multiagent decision-making scenarios with incomplete information. Finding a Nash equilibrium for very large instances of these games has received a great deal of recent attention. In this paper, we describe a new technique for...