Search

Filter

Subject / Keyword

Show 4 more ...

Author / Creator / Contributor

Show 4 more ...

Year

Collections

Show 2 more ...

Languages

Item type

Departments

Supervisors

Show 4 more ...

Interrelating Prediction and Control Objectives in Episodic Actor-Critic Reinforcement Learning
Download

Fall 2020

Chockalingam, Valliappa

The reinforcement learning framework provides a simple way to study computational intelligence as the interaction between an agent and an environment. The goal of an agent is to accrue as much reward as possible by intelligently choosing actions given states. This problem of finding a policy that...
Intrusion Detection Based on Reinforcement Learning
Download

Fall 2021

Yang, Bin

Powered by advancements of information and Internet technologies, there has been a rapid development in network based applications in recent years. Meanwhile, it is recognized that more attentions need to be paid to the issue of cybersecurity. The security of the network environment plays a vital...
Investigating Two Policy Gradient Methods Under Different Time Discretizations
Download

Fall 2021

Farrahi, Homayoon

Continuous-time reinforcement learning tasks commonly use discrete time steps of fixed cycle times for actions. Choosing a small action-cycle time in such tasks allows reinforcement learning agents fast reaction and a more temporally detailed perception of the environment. The learning...
Large-scale Document Understanding with Knowledge Graphs for Medical Applications
Download

Spring 2024

Costello, Jeremy

We introduce the background of the natural language processing field, outlining the benefits and drawbacks of rule-based versus statistical methods. We present knowledge graphs as a way to integrate the explainability of rule-based methods and the power of statistical methods, large language...
Learning and Planning with the Average-Reward Formulation
Download

Fall 2023

Wan, Yi

The average-reward formulation is a natural and important formulation of learning and planning problems, yet has received much less attention than the episodic and discounted formulations. This dissertation makes three areas of contributions to algorithms and their theories concerning the...
Learning Programmatic Policies from ReLU Neural Networks
Download

Spring 2023

Orfanos, Spyros

Oblique decision trees use linear combinations of features in the decision nodes. Due to the non-smooth structure of decision trees, training oblique decision trees is considerably difficult as the parameters are tuned using expensive non-differentiable optimization techniques or found by...
Learning What to Remember: Strategies for Selective External Memory in Online Reinforcement Learning Agents
Download

Spring 2019

Young, Kenneth

In realistic environments, intelligent agents must learn to integrate information from their past to inform present decisions. An agent's immediate observations are often limited, and some degree of memory is necessary to complete many everyday tasks. However, an agent cannot remember everything...
Letting the Agent Take the Wheel: Principles for Constructive and Predictive Knowledge
Download

Fall 2023

Kearney, Alexandra K

Of all the capabilities of natural intelligence, one of the most exceptional is the ability to expand upon and refine knowledge of the world through subjective experience. Therefore, a longstanding goal of Artificial Intelligence has been to replicate this success: to enable artificial agents to...
Leveraging Off-Policy Prediction in Recurrent Networks for Reinforcement Learning
Download

Fall 2023

Schlegel, Matthew K

Partial observability---when the senses lack enough detail to make an optimal decision---is the reality of any decision making agent acting in the real world. While an agent could be made to make due with its available senses, taking advantage of the history of senses can provide more context and...
Linear Least-squares Dyna-style Planning
Download

2011

Yao, Hengshuai

Technical report TR11-04. World model is very important for model-based reinforcement learning. For example, a model is frequently used in Dyna: in learning steps to select actions and in planning steps to project sampled states or features. In this paper we propose least-squares Dyna (LS-Dyna)...