This decommissioned ERA site remains active temporarily to support our final migration steps to https://ualberta.scholaris.ca, ERA's new home. All new collections and items, including Spring 2025 theses, are at that site. For assistance, please contact erahelp@ualberta.ca.

Search

Filter

Author / Creator / Contributor

1Sina Ghiassian

Subject / Keyword

1Off-policy learning
1Online learning
1Prediction learning
1Ste-size Ratchet
1Step-size adaptation
1Temporal difference learning with regularized corrections

Year

Collections

1Graduate and Postdoctoral Studies (GPS), Faculty of
1Graduate and Postdoctoral Studies (GPS), Faculty of/Theses and Dissertations

Languages

1English

Item type

1Thesis

Departments

1Department of Computing Science

Supervisors

1Sutton, Richard (Computing Science)
1White, Adam (Computing Science)

Online Off-policy Prediction
Download

Spring 2022

Sina Ghiassian

In this dissertation, we study online off-policy temporal-difference learning algorithms, a class of reinforcement learning algorithms that can learn predictions in an efficient and scalable manner. The contributions of this dissertation are one of the two kinds: (1) empirically studying existing...

1 - 1 of 1

Search

Items (1)

Collections

Communities

Online Off-policy Prediction