Search

Filter

Departments

Author / Creator / Contributor

Show 4 more ...

Subject / Keyword

Show 4 more ...

Year

Collections

Languages

1418English

Item type

1418Thesis

Supervisors

Show 4 more ...

Improving the reliability of reinforcement learning algorithms through biconjugate Bellman errors
Download

Spring 2024

Patterson, Andrew

In this thesis, we seek to improve the reliability of reinforcement learning algorithms for nonlinear function approximation. Semi-gradient temporal difference (TD) update rules form the basis of most state-of-the-art value function learning systems despite clear counterexamples proving their...
Improving Water Treatment Using Reinforcement Learning
Download

Fall 2022

Liu, Puer

We have witnessed the rising popularity of real-world applications of reinforcement learning (RL). However, most successful real-world applications of RL rely on high-fidelity simulators that enable rapid iteration of prototypes, hyperparameter selection and policy training. On the other hand, RL...
In search of breast carcinoma: a mammographic asymmetry approach
Download

1990

Lau, Tin-Kit.
Incremental 3D Line Segments Extraction for Surface Reconstruction from Semi-dense SLAM
Download

Fall 2018

He, Shida

Semi-dense SLAM systems have become popular in the last few years. They can produce much denser point clouds than sparse SLAM while being computationally efficient (using only CPU). In previous works, the surface of the viewed scene was reconstructed in real-time by combining sparse SLAM system...
Incremental free-space carving for real-time 3D reconstruction
Download

Fall 2011

Lovi, David Israel

This thesis addresses the problem of automatic real-time 3D reconstruction of general scenes from monocular video. Whereas many impressively accurate reconstruction techniques exist in the multi-view stereo literature, most are slow offline batch methods designed to work in highly calibrated...
Incremental least-squares temporal difference learning
Download

2007

Geramifard, Alborz
Incremental Off-policy Reinforcement Learning Algorithms
Download

Fall 2017

Mahmood, Ashique

Model-free off-policy temporal-difference (TD) algorithms form a powerful component of scalable predictive knowledge representation due to their ability to learn numerous counter- factual predictions in a computationally scalable manner. In this dissertation, we address and overcome two...
Indexing and Querying Natural Language Text
Download

Spring 2012

Chubak, Pirooz

Natural language text is a prominent source of representing and communicating information and knowledge. It is often desirable to search in granularities of text that are smaller than a document or to query the syntactic roles and relationships within syntactically annotated text sentences, often...
Individual Survival Distributions: A More Effective Tool for Survival Prediction
Download

Fall 2019

Haider, Humza S

An accurate model of a patient’s individual survival distribution can help determine the appropriate treatment for terminal patients. Unfortunately, risk scores (e.g., from Cox Proportional Hazard models) do not provide survival probabilities, single-time probability models (e.g., the Gail model,...
Indoor localization with passive sensors
Download

Spring 2013

Vosoughpour Yazdchi, Meisam

In this thesis, a framework is described that is designed to perform indoor localization in the Smart Condo (TM). A significant aspect of the framework is that it mainly operates on the basis of binary sensors - including motion sensors and occupancy sensors - and it primarily involves geometric...