This decommissioned ERA site remains active temporarily to support our final migration steps to https://ualberta.scholaris.ca, ERA's new home. All new collections and items, including Spring 2025 theses, are at that site. For assistance, please contact erahelp@ualberta.ca.
Search
Skip to Search Results- 2Atari 2600
- 2UCT
- 1Artificial Intelligence
- 1Collusion detection
- 1Collusion prevention
- 1Extensive form game
-
Fall 2012
Collusion is the deliberate cooperation of two or more parties to the detriment of others. While this behaviour can be highly profitable for colluders (for example, in auctions and online games), it is considered illegal and unfair in many sequential decision-making domains and presents many...
-
Spring 2010
This research focuses on developing AI agents that play arbitrary Atari 2600 console games without having any game-specific assumptions or prior knowledge. Two main approaches are considered: reinforcement learning based methods and search based methods. The RL-based methods use feature vectors...
-
Monte Carlo Sampling and Regret Minimization for Equilibrium Computation and Decision-Making in Large Extensive Form Games
DownloadSpring 2013
In this thesis, we investigate the problem of decision-making in large two-player zero-sum games using Monte Carlo sampling and regret minimization methods. We demonstrate four major contributions. The first is Monte Carlo Counterfactual Regret Minimization (MCCFR): a generic family of...
-
Spring 2015
In this thesis, I study the problem of Monte-Carlo Planning in deterministic do- mains with sparse rewards. A popular algorithm in this suite, UCT, is studied. A new algorithm to incorporate state generalization in UCT using estimates of sim- ilar nodes and a distance metric is presented. The...
-
Fall 2018
Dyna is an architecture for reinforcement learning agents that interleaves planning, acting, and learning in an online setting. This architecture aims to make fuller use of limited experience to achieve better performance with fewer environmental interactions. Dyna has been well studied in...