Agent Tamer The Secret Life Of Algorithms

Irene Olayinka; Calarina Muslimani; Dr. Matthew Taylor

doi:doi:10.7939/r3-n78h-g120

This decommissioned ERA site remains active temporarily to support our final migration steps to https://ualberta.scholaris.ca, ERA's new home. All new collections and items, including Spring 2025 theses, are at that site. For assistance, please contact erahelp@ualberta.ca.

View

Download

Communities and Collections

WISEST Summer Research Program / WISEST Research Posters

Usage

166 views
197 downloads

Agent Tamer The Secret Life Of Algorithms

Author(s) / Creator(s)
Although this report deals with the mechanisms of artificially intelligent
rather than intelligence agents, the former is no less a subject of
fascination. My research centred around an algorithm called Training an
Agent Manually via Evaluative Reinforcement (TAMER), which
incorporates human feedback into a reinforcement learning model. I ran
several trials in the Mountain Car environment provided by the OpenAI
gym library, altering the uniform value, credit assignment value, and
budget of each to see which changes returned the best performance for
the agent. Ultimately, lower credit assignment values and uniform
values that are slightly better than those an average human trainer can
provide are most effective in improving the performance of the agent,
while the budget does not have a significant effect on the agent's
efficiency.
Date created

2021-08-01
Subjects / Keywords
Type of Item

Conference/Workshop Poster
DOI

https://doi.org/10.7939/r3-n78h-g120
License

Attribution-NonCommercial 4.0 International

Language
- English