Search

Filter

Subject / Keyword

Show 4 more ...

Author / Creator / Contributor

Show 4 more ...

Year

Collections

Show 2 more ...

Languages

Item type

Departments

Supervisors

Show 4 more ...

Low-Level control of small scale helicopter using Soft Actor-Critic method
Download

Fall 2021

Kamyab, Majid

Unmanned Aerial Vehicles (UAVs), or drones, have been employed in a variety of applications, ranging from surveillance to emergency operations. These systems comprise an ”inner loop” that provides stability and control and an ”outer loop” in charge of mission-level tasks, such as way-point...
Machine Learning and Deep Learning for Modeling and Control of Internal Combustion Engines
Download

Fall 2022

Norouzi Yengeje, Armin

Internal Combustion Engines (ICEs) are ubiquitous; they power a wide range of systems. The broad use of ICEs globally causes more than 20% of the total greenhouse gas emissions. In many countries, emission legislation is transitioning from certification using only traditional chassis dynomometer...
Methodical Advice Collection and Reuse in Deep Reinforcement Learning
Download

Spring 2022

Sahir

Reinforcement learning (RL) has shown great success in solving many challenging tasks via the use of deep neural networks. Although the use of deep learning for RL brings immense representational power to the arsenal, it also causes sample inefficiency. This means that the algorithms are...
Monte Carlo Tree Search and Model Uncertainty
Download

Fall 2022

Kohankhaki, Farnaz

Monte Carlo Tree Search (MCTS) is a popular tree search framework for choos- ing actions in decision-making problems. MCTS is traditionally applied to applications in which a perfect simulation model is available. However, when the model is imperfect, the performance of MCTS drops heavily. In...
Monte Carlo Tree Search in the Presence of Model Uncertainty
Download

Fall 2022

Aghakasiri, Kiarash

Monte Carlo Tree Search (MCTS) is an extremely successful search-based frame- work for decision making. With an accurate simulator of the environment’s dynamics, it can achieve great performance in many games and non-games applications. However, without a perfect simulator, the performance...
No More Pesky Hyperparameters: Offline Hyperparameter Tuning For Reinforcement Learning
Download

Fall 2021

Sakhadeo, Archit

The performance of reinforcement learning (RL) agents is sensitive to the choice of hyperparameters. In real-world settings like robotics or industrial control systems, however, testing different hyperparameter configurations directly on the environment can be financially prohibitive, dangerous,...
Non-uniform Analysis for Non-convex Optimization in Machine Learning
Download

Fall 2021

Mei, Jincheng

The optimization of non-convex objective functions is a topic of central interest in machine learning. Remarkably, it has recently been shown that simple gradient-based optimization can achieve globally optimal solutions in important non-convex problems that arise in machine learning, including...
On Efficient Planning in Large Action Spaces with Applications to Cooperative Multi-Agent Reinforcement Learning
Download

Fall 2023

Tkachuk, Volodymyr

A practical challenge in reinforcement learning is large action spaces that make planning computationally demanding. For example, in cooperative multi-agent reinforcement learning, a potentially large number of agents jointly optimize a global reward function, which leads to a blow-up in the...
On the Application of Continuous Deterministic Reinforcement Learning in Neural Architecture Search
Download

Spring 2021

Mills, Keith G.

Architecture evaluation is a major bottleneck of Neural Architecture Search (NAS). Recent trends have seen a shift in favor of weight-sharing networks capable of superimposing all possible candidate architectures in a search space. Nevertheless, this technique is not beyond reproach, and has...
On the benefits of sparsity in value function approximators for Reinforcement Learning
Download

Spring 2024

Davelouis Gallardo, Fatima D

In machine learning, sparse neural networks provide higher computational efficiency and in some cases, can perform just as well as fully-connected networks. In the online and incremental reinforcement learning (RL) problem, Prediction Adapted Networks (Martin and Modayil, 2021) is an algorithm...