This is a decommissioned version of ERA which is running to enable completion of migration processes. All new collections and items and all edits to existing items should go to our new ERA instance at https://ualberta.scholaris.ca - Please contact us at erahelp@ualberta.ca for assistance!

Search

Filter

Subject / Keyword

Show 4 more ...

Collections

Supervisors

Author / Creator / Contributor

Year

Languages

3English

Item type

3Thesis

Departments

3Department of Computing Science

Online Learning for Linearly Parametrized Control Problems
Download

Spring 2013

Abbasi-Yadkori, Yasin

In a discrete-time online control problem, a learner makes an effort to control the state of an initially unknown environment so as to minimize the sum of the losses he suffers, where the losses are assumed to depend on the individual state-transitions. Various models of control problems have...
Primal-Dual Algorithms for Learning in Constrained Markov Decision Processes
Download

Fall 2023

Liu, Chang

Many real-world tasks in fields such as robotics and control can be formulated as constrained Markov decision processes (CMDPs). In CMDPs, the objective is usually to optimize the return while ensuring some constraints being satisfied at the same time. The primal-dual approach is a common...
Regularization in reinforcement learning
Download

Fall 2011

Farahmand, Amir-massoud

This thesis studies the reinforcement learning and planning problems that are modeled by a discounted Markov Decision Process (MDP) with a large state space and finite action space. We follow the value-based approach in which a function approximator is used to estimate the optimal value function....

1 - 3 of 3

Search

Items (3)

Collections

Communities

Online Learning for Linearly Parametrized Control Problems

Primal-Dual Algorithms for Learning in Constrained Markov Decision Processes

Regularization in reinforcement learning