Search
Skip to Search Results
Filter
Author / Creator / Contributor
Collections
Supervisors
Subject / Keyword
Year
Languages
Item type
Departments
-
Fall 2019
Policy evaluation, learning value functions, is an integral part of the reinforcement learning problem. In this thesis, I propose a neural network architecture, the Two-Timescale Network (TTN), for value function approximation which utilizes linear function approximation for the value function...
1 - 1 of 1