This decommissioned ERA site remains active temporarily to support our final migration steps to https://ualberta.scholaris.ca, ERA's new home. All new collections and items, including Spring 2025 theses, are at that site. For assistance, please contact erahelp@ualberta.ca.

Search

Filter

Author / Creator / Contributor

1Bennett, Brendan

Subject / Keyword

1artificial intelligence
1machine learning
1reinforcement learning
1temporal difference methods
1variance estimation
1variance of returns

Year

Collections

1Graduate and Postdoctoral Studies (GPS), Faculty of
1Graduate and Postdoctoral Studies (GPS), Faculty of/Theses and Dissertations

Languages

1English

Item type

1Thesis

Departments

1Department of Computing Science

Supervisors

1Sutton, Richard (Computing Science)

Estimating Variance of Returns using Temporal Difference Methods
Download

Spring 2021

Bennett, Brendan

Temporal difference (TD) methods provide a powerful means of learning to make predictions in an online, model-free, and highly scalable manner. In the reinforcement learning (RL) framework, we formalize these prediction targets in terms of a (possibly discounted) sum of rewards, called the...

1 - 1 of 1

Search

Items (1)

Collections

Communities

Estimating Variance of Returns using Temporal Difference Methods