Search

Filter

Subject / Keyword

Show 4 more ...

Departments

19Department of Computing Science

Author / Creator / Contributor

Show 4 more ...

Year

Collections

Languages

19English

Item type

19Thesis

Supervisors

Show 4 more ...

Improving Sample Efficiency of Online Temporal Difference Learning
Download

Fall 2021

Pan, Yangchen

A common scientific challenge for putting a reinforcement learning agent into practice is how to improve sample efficiency as much as possible with limited computational or memory resources. Such available physical resources may vary in different applications. My thesis introduces some approaches...
Incremental Off-policy Reinforcement Learning Algorithms
Download

Fall 2017

Mahmood, Ashique

Model-free off-policy temporal-difference (TD) algorithms form a powerful component of scalable predictive knowledge representation due to their ability to learn numerous counter- factual predictions in a computationally scalable manner. In this dissertation, we address and overcome two...
Model-based Reinforcement Learning with State and Action Abstractions
Download

Spring 2016

Yao,Hengshuai

In model-based reinforcement learning a model is learned which is then used to find good actions. What model to learn? We investigate these questions in the context of two different approaches to model-based reinforcement learning. We also investigate how one should learn and plan when the reward...
Model-Free Intelligent Diabetes Management Using Machine Learning
Download

Spring 2014

Bastani, Meysam

Each patient with Type-1 diabetes must decide how much insulin to inject before each meal to maintain an acceptable level of blood glucose. The actual injection dose is based on a formula that takes current blood glucose level and the meal size into consideration. While following this insulin...
Reinforcement Learning and Simulation-Based Search in Computer Go
Download

Fall 2009

Silver, David

Learning and planning are two fundamental problems in artificial intelligence. The learning problem can be tackled by reinforcement learning methods, such as temporal-difference learning, which update a value function from real experience, and use function approximation to generalise across...
Statistical analysis of L1-penalized linear estimation with applications
Download

Spring 2012

Ávila Pires, Bernardo

We study linear estimation based on perturbed data when performance is measured by a matrix norm of the expected residual error, in particular, the case in which there are many unknowns, but the “best” estimator is sparse, or has small L1-norm. We propose a Lasso-like procedure that finds the...
Teaching a Powered Prosthetic Arm with an Intact Arm Using Reinforcement Learning
Download

Fall 2017

Vasan, Gautham

The idea of an amputee playing the piano with all the flair and grace of an able-handed person may seem like a futuristic fantasy. While many prosthetic limbs look lifelike, finding one that also moves naturally has proved more of a challenge for both researchers and amputees. Even though...
Towards Prosthetic Arms as Wearable Intelligent Robots
Download

Fall 2015

Sherstan, Craig

The control of powered prosthetic arms has been researched for over 50 years, yet prosthetic control remains an open problem, not just from a research perspective, but from a clinical perspective as well. Significant advances have been made in the manufacture of highly functional prosthetic...
Using behaviour patterns to generate scripts for computer role-playing games
Download

Fall 2009

Cutumisu, Maria

Character behaviours in computer role-playing games have a significant impact on game-play, but are often difficult for story authors to implement and modify. Many computer games use custom scripts to control the behaviours of non-player characters (NPCs). Therefore, a story author must write...