Search
Skip to Search Results
Filter
Subject / Keyword
Supervisors
Author / Creator / Contributor
Year
Collections
Languages
Item type
Departments
-
Fall 2019
Q-learning can be difficult to use in continuous action spaces, because a difficult optimization has to be solved to find the maximal action. Some common strategies have been to discretize the action space, solve the maximization with a powerful optimizer at each step, restrict the functional...
1 - 1 of 1