This decommissioned ERA site remains active temporarily to support our final migration steps to https://ualberta.scholaris.ca, ERA's new home. All new collections and items, including Spring 2025 theses, are at that site. For assistance, please contact erahelp@ualberta.ca.

Search

Filter

Subject / Keyword

Show 4 more ...

Item type

2Thesis

Author / Creator / Contributor

Year

Collections

Languages

2English

Departments

2Department of Computing Science

Supervisors

Unifying n-Step Temporal-Difference Action-Value Methods
Download

Spring 2019

Juan Fernando Hernandez Garcia

Unifying seemingly disparate algorithmic ideas to produce better performing algorithms has been a longstanding goal in reinforcement learning. As a primary example, the TD(λ) algorithm elegantly unifies temporal difference (TD) methods with Monte Carlo methods through the use of eligibility...
Using Regret Estimation to Solve Games Compactly
Download

Spring 2016

Morrill, Dustin R

Game theoretic solution concepts, such as Nash equilibrium strategies that are optimal against worst case opponents, provide guidance in finding desirable autonomous agent behaviour. In particular, we wish to approximate solutions to complex, dynamic tasks, such as negotiation or bidding in...

1 - 2 of 2

Search

Items (2)

Collections

Communities

Unifying n-Step Temporal-Difference Action-Value Methods

Using Regret Estimation to Solve Games Compactly