Search
Skip to Search Results- 84Artificial Intelligence
- 22Machine Learning
- 10Reinforcement Learning
- 8Planning
- 7Computer Games
- 7Computing Science
- 4Müller, Martin
- 3Mueller, Martin
- 2Johanson, Michael
- 2Nakhost, Hootan
- 2Pelletier, Francis J.
- 2Schaeffer, Jonathan
- 50Graduate and Postdoctoral Studies (GPS), Faculty of
- 50Graduate and Postdoctoral Studies (GPS), Faculty of/Theses and Dissertations
- 18Computing Science, Department of
- 18Computing Science, Department of/Technical Reports (Computing Science)
- 4WISEST Summer Research Program
- 4WISEST Summer Research Program/WISEST Research Posters
- 50Thesis
- 20Report
- 7Research Material
- 3Conference/Workshop Poster
- 2Review
- 1Conference/Workshop Presentation
-
Strengths, Weaknesses, and Combinations of Model-based and Model-free Reinforcement Learning
DownloadSpring 2016
Reinforcement learning algorithms are conventionally divided into two approaches: a model-based approach that builds a model of the environment and then computes a value function from the model, and a model-free approach that directly estimates the value function. The first contribution of this...
-
2011
Technical report TR11-01. Causality is a fundamental concept in reasoning. The effectiveness of many reasoning tasks depends on the understanding of the underlying cause-effect relationships. Therefore, the notion of causality has been explored in a wide range of disciplines. Causal discovery,...
-
Fall 2013
Given nothing but the generative model of the environment, Monte Carlo Tree Search techniques have recently shown spectacular results on domains previously thought to be intractable. In this thesis we try to develop generic techniques for temporal abstraction inside MCTS that would allow the...
-
Fall 2022
Medical Fake News is a pervasive part of the information that people consume on the internet. It may lead people to take actions which may put the lives of their family and community in danger - such actions include vaccine hesitancy, administering unverified and harmful treatments, etc. First...
-
Spring 2017
With the growing population of the elderly and the decline of population growth rate, developed countries are facing problems in taking care of their elderly. One of the issues that is becoming more severe is the issue of companionship for the aged people, particularly those who chose to live...
-
Fall 2023
The increasing popularity of Deep Neural Networks (DNN) has led to their application to many domains, including Music Generation. However, these large DNN-based models are heavily dependent on their training dataset, which means they perform poorly on musical genres that are out-of-distribution...
-
2019-10-01
SSHRC IG awarded 2020: The global economy is on the verge of a profound transformation as artificial intelligence (AI) achieves and exceeds human-level abilities in a growing number of domains. Canada is already a world leader in the development and commercialization of AI technologies. However,...
-
Fall 2020
This thesis is offered as a step forward in our understanding of forgetting in artificial neural networks. ANNs are a learning system loosely based on our understanding of the brain and are responsible for recent breakthroughs in artificial intelligence. However, they have been reported to be...
-
Spring 2020
Reinforcement learning (RL) is a powerful learning paradigm in which agents can learn to maximize sparse and delayed reward signals. Although RL has had many impressive successes in complex domains, learning can take hours, days, or even years of training data. A major challenge of contemporary...