Parameter Screening for Curious Reinforcement Learner Motivated by Unexpected Error

Ady, Nadia M.

doi:doi:10.7939/R3G15TS0P

This decommissioned ERA site remains active temporarily to support our final migration steps to https://ualberta.scholaris.ca, ERA's new home. All new collections and items, including Spring 2025 theses, are at that site. For assistance, please contact erahelp@ualberta.ca.

View

Download

Communities and Collections

Computing Science, Department of / Technical Reports (Computing Science)

Usage

254 views
190 downloads

Parameter Screening for Curious Reinforcement Learner Motivated by Unexpected Error

Author(s) / Creator(s)
- Ady, Nadia M.
Curiosity is a critical component of intelligence. One method of motivating curious behaviour in computational systems is to use reinforcement learning to learn which decisions maximize the amount of unexpected error observed by a predictive component. However, reinforcement learning algorithms for prediction and control require the system designer to set multiple parameters, and it is unknown how such a curious system’s behaviour might vary depending on parameter settings. Eight parameters (one learning rate, continuation probability, trace decay parameter for both prediction and control, 'epsilon' (the probability of a random action for epsilon-greedy control) and beta-naught parameter for computation of White’s (2015) unexpected error) were tested in an inscribed central composite experimental design. The response variable was the return. We found that the linear effects on return for epsilon, the learning rate for control, the continuation probability for prediction, and the beta-naught parameter for unexpected error were significant, along with the quadratic interactions between epsilon and beta-naught, epsilon and the continuation probability for prediction, beta-naught and the continuation probability for prediction, and the learning rate and continuation probability for prediction.
Date created

2017-04-10
Subjects / Keywords
Type of Item

Report
DOI

https://doi.org/10.7939/R3G15TS0P
License

Attribution-NonCommercial-NoDerivatives 4.0 International

Language
- English
Additional contributors
- Pilarski, Patrick M.