This decommissioned ERA site remains active temporarily to support our final migration steps to https://ualberta.scholaris.ca, ERA's new home. All new collections and items, including Spring 2025 theses, are at that site. For assistance, please contact erahelp@ualberta.ca.

Search

Filter

Author / Creator / Contributor

1Patterson, Andrew

Subject / Keyword

1Machine Learning
1Reinforcement Learning

Year

Collections

1Graduate and Postdoctoral Studies (GPS), Faculty of
1Graduate and Postdoctoral Studies (GPS), Faculty of/Theses and Dissertations

Languages

1English

Item type

1Thesis

Departments

1Department of Computing Science

Supervisors

1White, Martha (Computing Science)

Improving the reliability of reinforcement learning algorithms through biconjugate Bellman errors
Download

Spring 2024

Patterson, Andrew

In this thesis, we seek to improve the reliability of reinforcement learning algorithms for nonlinear function approximation. Semi-gradient temporal difference (TD) update rules form the basis of most state-of-the-art value function learning systems despite clear counterexamples proving their...

1 - 1 of 1

Search

Items (1)

Collections

Communities

Improving the reliability of reinforcement learning algorithms through biconjugate Bellman errors