Search
Skip to Search Results- 1Jeya Veeraiah, Vivek Veeriah
- 1Mahmood, Ashique
- 1Naik, Abhishek
- 1Ni, Jingjiao
- 1Sherstan, Craig
- 1Shibhansh Dohare
- 2Reinforcement learning
- 2reinforcement learning
- 1Artificial intelligence
- 1Collaborative control
- 1Electromyography
- 1External Memory
-
Fall 2010
Performance and stability of many iterative algorithms such as stochastic gradient descent largely depend on a fixed and scalar step-size parameter. Use of a fixed and scalar step-size value may lead to limited performance in many problems. We study several existing step-size adaptation...
-
Spring 2018
Facial expressions and other body language are important for human commu- nication. They complement speech and make the process of communication simple and sustainable. However, the process of communication using existing approaches to human-machine interaction is not intuitive as that of human...
-
Learning What to Remember: Strategies for Selective External Memory in Online Reinforcement Learning Agents
DownloadSpring 2019
In realistic environments, intelligent agents must learn to integrate information from their past to inform present decisions. An agent's immediate observations are often limited, and some degree of memory is necessary to complete many everyday tasks. However, an agent cannot remember everything...
-
Spring 2024
In this dissertation, I investigate how we can exploit generic problem structure to make reinforcement learning algorithms more efficient. Generic problem structure means basic structure that exists in a wide range of problems (e.g., an action taken in the present does not influence the past), as...
-
Spring 2024
This dissertation develops simple and practical learning algorithms from first principles for long-lived agents. Formally, the algorithms are developed within the reinforcement learning framework for continuing (non-episodic) problems, in which the agent-environment interaction goes on ad...
-
Fall 2020
We explore the interplay of generate-and-test and gradient-descent techniques for solving online supervised learning problems. The task in supervised learning is to learn a function using samples of inputs to output pairs. This function is called the target function. The standard way to learn...
-
Spring 2021
Emphatic-Temporal-Difference (Emphatic-TD) learning algorithms were recently proposed based on the most central and widely used reinforcement learning algorithms, Temporal-Difference (TD) methods. Emphatic-TD learning algorithms were originally designed to solve the divergence problem of...
-
Fall 2015
The control of powered prosthetic arms has been researched for over 50 years, yet prosthetic control remains an open problem, not just from a research perspective, but from a clinical perspective as well. Significant advances have been made in the manufacture of highly functional prosthetic...