Search
Skip to Search Results
Filter
Author / Creator / Contributor
Subject / Keyword
- 2Reinforcement Learning
- 1Artificial Intelligence
- 1Decision-Making
- 1Dyna
- 1Epsilon Greedy Policy
- 1Grid World
Year
Collections
Item type
Departments
-
Chasing Hallucinated Value: A Pitfall of Dyna Style Algorithms with Imperfect Environment Models
DownloadSpring 2020
In Dyna style algorithms, reinforcement learning (RL) agents use a model of the environment to generate simulated experience. By updating on this simulated experience, Dyna style algorithms allow agents to potentially learn control policies in fewer environment interactions than agents that use...
1 - 2 of 2