Search
Skip to Search Results- 70Reinforcement Learning
- 15Machine Learning
- 7Artificial Intelligence
- 6Transfer Learning
- 5Planning
- 5Representation Learning
- 1Abbasi-Yadkori, Yasin
- 1Aghakasiri, Kiarash
- 1Alikhasi, Mahdi
- 1Asadi Atui, Kavosh
- 1Banafsheh Rafiee
- 1Behboudian, Paniz
-
Fall 2022
OpenSpiel is an open-source software system for implementing high-performance software players for many different computer games. Hex is a two-player game of perfect information used in a variety of computer games research projects. The OpenSpiel project has implemented a version of the AlphaZero...
-
Feature Generalization in Deep Reinforcement Learning: An Investigation into Representation Properties
DownloadFall 2022
In this thesis, we investigate the connection between the properties and the generalization performance of representations learned by deep reinforcement learning algorithms. Much of the earlier work on representation learning for reinforcement learning focused on designing fixed-basis...
-
Spring 2010
This research focuses on developing AI agents that play arbitrary Atari 2600 console games without having any game-specific assumptions or prior knowledge. Two main approaches are considered: reinforcement learning based methods and search based methods. The RL-based methods use feature vectors...
-
Fall 2022
This thesis investigates a new approach to model-based reinforcement learning using background planning: mixing (approximate) dynamic programming updates and model-free updates, similar to the Dyna architecture. Background planning with learned models is often worse than model-free alternatives,...
-
Fall 2011
We present a new family of gradient temporal-difference (TD) learning methods with function approximation whose complexity, both in terms of memory and per-time-step computation, scales linearly with the number of learning parameters. TD methods are powerful prediction techniques, and with...
-
Improving the reliability of reinforcement learning algorithms through biconjugate Bellman errors
DownloadSpring 2024
In this thesis, we seek to improve the reliability of reinforcement learning algorithms for nonlinear function approximation. Semi-gradient temporal difference (TD) update rules form the basis of most state-of-the-art value function learning systems despite clear counterexamples proving their...
-
Fall 2022
We have witnessed the rising popularity of real-world applications of reinforcement learning (RL). However, most successful real-world applications of RL rely on high-fidelity simulators that enable rapid iteration of prototypes, hyperparameter selection and policy training. On the other hand, RL...
-
Fall 2020
Communication is essential for coordination among humans and animals. Therefore, with the introduction of intelligent agents into the world, agent-to-agent and agent-to-human communication become necessary. Ideally, these agents should be trained in an incremental and decentralized manner. In...
-
Spring 2021
This thesis investigates the use of general value functions for detecting anomalous behavior in machines. Identifying abnormal behavior is critical for ensuring the safety and reliability of any machine or industrial process. When the cause of these anomalies is due to accumulated wear on...
-
Interrelating Prediction and Control Objectives in Episodic Actor-Critic Reinforcement Learning
DownloadFall 2020
The reinforcement learning framework provides a simple way to study computational intelligence as the interaction between an agent and an environment. The goal of an agent is to accrue as much reward as possible by intelligently choosing actions given states. This problem of finding a policy that...