This decommissioned ERA site remains active temporarily to support our final migration steps to https://ualberta.scholaris.ca, ERA's new home. All new collections and items, including Spring 2025 theses, are at that site. For assistance, please contact erahelp@ualberta.ca.

Search

Filter

Subject / Keyword

1Time Discretization

Show 1 more ...

Departments

3Department of Computing Science

Author / Creator / Contributor

Year

Collections

Languages

3English

Item type

3Thesis

Supervisors

Adaptive Representation for Policy Gradient
Download

Spring 2015

Das Gupta, Ujjwal

Much of the focus on finding good representations in reinforcement learning has been on learning complex non-linear predictors of value. Methods like policy gradient, that do not learn a value function and instead directly represent policy, often need fewer parameters to learn good policies....
Investigating Two Policy Gradient Methods Under Different Time Discretizations
Download

Fall 2021

Farrahi, Homayoon

Continuous-time reinforcement learning tasks commonly use discrete time steps of fixed cycle times for actions. Choosing a small action-cycle time in such tasks allows reinforcement learning agents fast reaction and a more temporally detailed perception of the environment. The learning...
Policy Gradient Reinforcement Learning Without Regret
Download

Spring 2015

Dick, Travis B

This thesis consists of two independent projects, each contributing to a central goal of artificial intelligence research: to build computer systems that are capable of performing tasks and solving problems without problem-specific direction from us, their designers. I focus on two formal...

1 - 3 of 3

Search

Items (3)

Collections

Communities

Adaptive Representation for Policy Gradient

Investigating Two Policy Gradient Methods Under Different Time Discretizations

Policy Gradient Reinforcement Learning Without Regret