This decommissioned ERA site remains active temporarily to support our final migration steps to https://ualberta.scholaris.ca, ERA's new home. All new collections and items, including Spring 2025 theses, are at that site. For assistance, please contact erahelp@ualberta.ca.

Search

Filter

Author / Creator / Contributor

1Farahmand, Amir-massoud

Subject / Keyword

1Approximate Value/Policy Iteration
1Error Propagation
1Machine Learning
1Model Selection
1Regularization
1Regularized Fitted Q-Iteration

1Regularized LSTD
1Regularized Least-Squares Regression
1Regularized Policy Iteration
1Reinforcement Learning

Show 4 more ...

Year

Collections

1Graduate and Postdoctoral Studies (GPS), Faculty of
1Graduate and Postdoctoral Studies (GPS), Faculty of/Theses and Dissertations

Languages

1English

Item type

1Thesis

Departments

1Department of Computing Science

Supervisors

1Jagersand, Martin (Computing Science)
1Szepesvari, Csaba (Computing Science)

Regularization in reinforcement learning
Download

Fall 2011

Farahmand, Amir-massoud

This thesis studies the reinforcement learning and planning problems that are modeled by a discounted Markov Decision Process (MDP) with a large state space and finite action space. We follow the value-based approach in which a function approximator is used to estimate the optimal value function....

1 - 1 of 1

Search

Items (1)

Collections

Communities

Regularization in reinforcement learning