This decommissioned ERA site remains active temporarily to support our final migration steps to https://ualberta.scholaris.ca, ERA's new home. All new collections and items, including Spring 2025 theses, are at that site. For assistance, please contact erahelp@ualberta.ca.
Search
Skip to Search Results- 101Reinforcement Learning
- 23Machine Learning
- 12Artificial Intelligence
- 6Transfer Learning
- 5Planning
- 5Representation Learning
- 91Graduate and Postdoctoral Studies (GPS), Faculty of
- 91Graduate and Postdoctoral Studies (GPS), Faculty of/Theses and Dissertations
- 5Computing Science, Department of
- 5Computing Science, Department of/Technical Reports (Computing Science)
- 3WISEST Summer Research Program
- 3WISEST Summer Research Program/WISEST Research Posters
-
Improving the reliability of reinforcement learning algorithms through biconjugate Bellman errors
DownloadSpring 2024
In this thesis, we seek to improve the reliability of reinforcement learning algorithms for nonlinear function approximation. Semi-gradient temporal difference (TD) update rules form the basis of most state-of-the-art value function learning systems despite clear counterexamples proving their...
-
Fall 2022
We have witnessed the rising popularity of real-world applications of reinforcement learning (RL). However, most successful real-world applications of RL rely on high-fidelity simulators that enable rapid iteration of prototypes, hyperparameter selection and policy training. On the other hand, RL...
-
Fall 2020
Communication is essential for coordination among humans and animals. Therefore, with the introduction of intelligent agents into the world, agent-to-agent and agent-to-human communication become necessary. Ideally, these agents should be trained in an incremental and decentralized manner. In...
-
Fall 2022
This thesis targets output tracking problem for payload position and quadrotor yaw in an slung load system (SLS). In spite of its relatively extensive literature, full SLS control is still a challenging problem since its dimension, nonlinearity, and multiple sources of disturbances are not easy...
-
Spring 2021
This thesis investigates the use of general value functions for detecting anomalous behavior in machines. Identifying abnormal behavior is critical for ensuring the safety and reliability of any machine or industrial process. When the cause of these anomalies is due to accumulated wear on...
-
Interrelating Prediction and Control Objectives in Episodic Actor-Critic Reinforcement Learning
DownloadFall 2020
The reinforcement learning framework provides a simple way to study computational intelligence as the interaction between an agent and an environment. The goal of an agent is to accrue as much reward as possible by intelligently choosing actions given states. This problem of finding a policy that...
-
Fall 2021
Powered by advancements of information and Internet technologies, there has been a rapid development in network based applications in recent years. Meanwhile, it is recognized that more attentions need to be paid to the issue of cybersecurity. The security of the network environment plays a vital...
-
Fall 2024
Experience replay, the reuse of past data to improve sample efficiency, is ubiquitous in reinforcement learning. Though a variety of smart sampling schemes have been introduced to improve performance, uniform sampling by far remains the most common approach. One exception is Prioritized...
-
Fall 2024
If we aspire to design algorithms that can run for long periods, continually adapting to new, unexpected situations, then we must be willing to deploy our agents without tuning their hyperparameters over the agent’s entire lifetime. The standard practice in deep RL—and even continual RL—is to...