Search
Skip to Search Results
Filter
Subject / Keyword
- 2Approximate dynamic programming
- 2Function approximation
- 2Temporal difference learning
- 2Two-timescale stochastic approximation
- 1Active learning
- 1Actor-critic methods
Languages
Author / Creator / Contributor
Year
Collections
-
Computationally effective optimization methods for complex process control and scheduling problems
DownloadFall 2011
Over the years, how to reduce the operational cost, raise the profit and enhance the operational safety attracts tremendous interests in the chemical and petroleum industry. Since the regulatory control strategy may not achieve such rigorous requirements, higher level process control activities,...
-
2009
Bhatnagar, Shalabh, Sutton, Richard, Ghavamzadeh, Mohammad, Lee, Mark
Technical report TR09-10. We present four new reinforcement learning algorithms based on actor-critic, function approximation, and natural gradient ideas, and we provide their convergence proofs. Actor-critic reinforcement learning methods are online approximations to policy iteration in which...
1 - 3 of 3