Search

Filter

Subject / Keyword

Show 4 more ...

Author / Creator / Contributor

Show 4 more ...

Year

Collections

Show 4 more ...

Languages

Item type

1Learning Object

Show 1 more ...

Departments

Supervisors

Show 4 more ...

Strengths, Weaknesses, and Combinations of Model-based and Model-free Reinforcement Learning
Download

Spring 2016

Asadi Atui, Kavosh

Reinforcement learning algorithms are conventionally divided into two approaches: a model-based approach that builds a model of the environment and then computes a value function from the model, and a model-free approach that directly estimates the value function. The first contribution of this...
Structure Learning of Causal Bayesian Networks: A Survey
Download

2011

Mahmood, Ashique

Technical report TR11-01. Causality is a fundamental concept in reasoning. The effectiveness of many reasoning tasks depends on the understanding of the underlying cause-effect relationships. Therefore, the notion of causality has been explored in a wide range of disciplines. Causal discovery,...
Temporal Abstraction in Monte Carlo Tree Search
Download

Fall 2013

Vafadost, Mostafa

Given nothing but the generative model of the environment, Monte Carlo Tree Search techniques have recently shown spectacular results on domains previously thought to be intractable. In this thesis we try to develop generic techniques for temporal abstraction inside MCTS that would allow the...
The Nature of Decision-Making: Human Behavior vs. Machine Learning
Download

2019-01-01

Beausoleil, Keeya
Towards Checking Veracity of Medical Claims
Download

Fall 2022

Dhankar, Abhishek

Medical Fake News is a pervasive part of the information that people consume on the internet. It may lead people to take actions which may put the lives of their family and community in danger - such actions include vaccine hesitancy, administering unverified and harmful treatments, etc. First...
Towards the Implementation of an Intelligent Software Agent for the Elderly
Download

Spring 2017

Faghih Dinevari, Amir Hossein

With the growing population of the elderly and the decline of population growth rate, developed countries are facing problems in taking care of their elderly. One of the issues that is becoming more severe is the issue of companionship for the aged people, particularly those who chose to live...
Transfer Learning for Underrepresented Music Generation
Download

Fall 2023

Doosti Sanjani, Anahita

The increasing popularity of Deep Neural Networks (DNN) has led to their application to many domains, including Music Generation. However, these large DNN-based models are heavily dependent on their training dataset, which means they perform poorly on musical genres that are out-of-distribution...
Understanding Consumer and Employee Reactions to AI Technologies

2019-10-01

Castelo, Noah

SSHRC IG awarded 2020: The global economy is on the verge of a profound transformation as artificial intelligence (AI) achieves and exceeds human-level abilities in a growing number of domains. Canada is already a world leader in the development and commercialization of AI technologies. However,...
Understanding Forgetting in Artificial Neural Networks
Download

Fall 2020

Ashley, Dylan R

This thesis is offered as a step forward in our understanding of forgetting in artificial neural networks. ANNs are a learning system loosely based on our understanding of the brain and are responsible for recent breakthroughs in artificial intelligence. However, they have been reported to be...
Useful Policy Invariant Shaping from Arbitrary Advice
Download

Spring 2020

Behboudian, Paniz

Reinforcement learning (RL) is a powerful learning paradigm in which agents can learn to maximize sparse and delayed reward signals. Although RL has had many impressive successes in complex domains, learning can take hours, days, or even years of training data. A major challenge of contemporary...