This decommissioned ERA site remains active temporarily to support our final migration steps to https://ualberta.scholaris.ca, ERA's new home. All new collections and items, including Spring 2025 theses, are at that site. For assistance, please contact erahelp@ualberta.ca.
Search
Skip to Search Results- 1Experience replay
- 1Reinforcement learning
- 1Spatial navigation
- 1control variates
- 1fourier
- 1reinforcement learning
-
Spring 2012
Mirian HosseinAbadi, MahdiehSadat
In this thesis we propose a computational model of animal behavior in spatial navigation, based on reinforcement learning ideas. In the field of computer science and specifically artificial intelligence, replay refers to retrieving and reprocessing the experiences that are stored in an abstract...
-
Fall 2018
Temporal-difference (TD) learning is an important approach for predictive knowledge representation and sequential decision making. Within TD learning exists multi-step methods which unify one-step TD learning and Monte Carlo methods in a way where intermediate algorithms can outperform either...