Search
Skip to Search Results
Filter
Subject / Keyword
Supervisors
Author / Creator / Contributor
Year
Collections
Languages
Item type
Departments
-
Fall 2019
Policy evaluation, learning value functions, is an integral part of the reinforcement learning problem. In this thesis, I propose a neural network architecture, the Two-Timescale Network (TTN), for value function approximation which utilizes linear function approximation for the value function...
1 - 1 of 1