This decommissioned ERA site remains active temporarily to support our final migration steps to https://ualberta.scholaris.ca, ERA's new home. All new collections and items, including Spring 2025 theses, are at that site. For assistance, please contact erahelp@ualberta.ca.

Search

Filter

Subject / Keyword

1Approximate Value/Policy Iteration
1Error Propagation
1Machine Learning
1Model Selection
1Regularization
1Regularized Fitted Q-Iteration

1Regularized LSTD
1Regularized Least-Squares Regression
1Regularized Policy Iteration
1Reinforcement Learning

Show 4 more ...

Item type

1Thesis

Supervisors

1Jagersand, Martin (Computing Science)
1Szepesvari, Csaba (Computing Science)

Author / Creator / Contributor

1Farahmand, Amir-massoud

Year

Collections

1Graduate and Postdoctoral Studies (GPS), Faculty of
1Graduate and Postdoctoral Studies (GPS), Faculty of/Theses and Dissertations

Languages

1English

Departments

1Department of Computing Science

Regularization in reinforcement learning
Download

Fall 2011

Farahmand, Amir-massoud

This thesis studies the reinforcement learning and planning problems that are modeled by a discounted Markov Decision Process (MDP) with a large state space and finite action space. We follow the value-based approach in which a function approximator is used to estimate the optimal value function....

1 - 1 of 1

Search

Items (1)

Collections

Communities

Regularization in reinforcement learning