Search
Skip to Search Results- 85Artificial Intelligence
- 22Machine Learning
- 21Game theory
- 10Computer Games
- 10Reinforcement Learning
- 8Planning
- 4Müller, Martin
- 3Bowling, Michael
- 3Johanson, Michael
- 3Lanctot, Marc
- 3Mueller, Martin
- 3Zinkevich, Martin
- 65Graduate and Postdoctoral Studies (GPS), Faculty of
- 65Graduate and Postdoctoral Studies (GPS), Faculty of/Theses and Dissertations
- 21Computing Science, Department of
- 21Computing Science, Department of/Technical Reports (Computing Science)
- 4Toolkit for Grant Success
- 4WISEST Summer Research Program
-
Fall 2013
Given nothing but the generative model of the environment, Monte Carlo Tree Search techniques have recently shown spectacular results on domains previously thought to be intractable. In this thesis we try to develop generic techniques for temporal abstraction inside MCTS that would allow the...
-
Spring 2018
Decision-making problems with two agents can be modeled as two player games, and a Nash equilibrium is the basic solution concept describing good play in adversarial games. Computing this equilibrium solution for imperfect information games, where players have private, hidden information, is...
-
Fall 2022
Medical Fake News is a pervasive part of the information that people consume on the internet. It may lead people to take actions which may put the lives of their family and community in danger - such actions include vaccine hesitancy, administering unverified and harmful treatments, etc. First...
-
Spring 2017
With the growing population of the elderly and the decline of population growth rate, developed countries are facing problems in taking care of their elderly. One of the issues that is becoming more severe is the issue of companionship for the aged people, particularly those who chose to live...
-
Fall 2023
The increasing popularity of Deep Neural Networks (DNN) has led to their application to many domains, including Music Generation. However, these large DNN-based models are heavily dependent on their training dataset, which means they perform poorly on musical genres that are out-of-distribution...
-
2019-10-01
SSHRC IG awarded 2020: The global economy is on the verge of a profound transformation as artificial intelligence (AI) achieves and exceeds human-level abilities in a growing number of domains. Canada is already a world leader in the development and commercialization of AI technologies. However,...
-
Fall 2020
This thesis is offered as a step forward in our understanding of forgetting in artificial neural networks. ANNs are a learning system loosely based on our understanding of the brain and are responsible for recent breakthroughs in artificial intelligence. However, they have been reported to be...
-
Spring 2020
Reinforcement learning (RL) is a powerful learning paradigm in which agents can learn to maximize sparse and delayed reward signals. Although RL has had many impressive successes in complex domains, learning can take hours, days, or even years of training data. A major challenge of contemporary...