This decommissioned ERA site remains active temporarily to support our final migration steps to https://ualberta.scholaris.ca, ERA's new home. All new collections and items, including Spring 2025 theses, are at that site. For assistance, please contact erahelp@ualberta.ca.
Search
Skip to Search Results
Filter
Subject / Keyword
- 1Approximate Value/Policy Iteration
- 1Error Propagation
- 1Machine Learning
- 1Model Selection
- 1Regularization
- 1Regularized Fitted Q-Iteration
Item type
Author / Creator / Contributor
Year
Collections
Languages
Departments
-
Fall 2011
This thesis studies the reinforcement learning and planning problems that are modeled by a discounted Markov Decision Process (MDP) with a large state space and finite action space. We follow the value-based approach in which a function approximator is used to estimate the optimal value function....
1 - 1 of 1