This decommissioned ERA site remains active temporarily to support our final migration steps to https://ualberta.scholaris.ca, ERA's new home. All new collections and items, including Spring 2025 theses, are at that site. For assistance, please contact erahelp@ualberta.ca.

Search

Filter

Author / Creator / Contributor

1Joulani, Pooria

Subject / Keyword

1Delayed Feedback
1Multi-Armed Bandit
1Online Learning

Departments

1Department of Computing Science

Year

Collections

1Graduate and Postdoctoral Studies (GPS), Faculty of
1Graduate and Postdoctoral Studies (GPS), Faculty of/Theses and Dissertations

Languages

1English

Item type

1Thesis

Supervisors

1Szepesvari, Csaba (Computing Science)

Multi-Armed Bandit Problems under Delayed Feedback
Download

Fall 2012

Joulani, Pooria

In this thesis, the multi-armed bandit (MAB) problem in online learning is studied, when the feedback information is not observed immediately but rather after arbitrary, unknown, random delays. In the stochastic" setting when the rewards come from a fixed distribution, an algorithm is given that...

1 - 1 of 1

Search

Items (1)

Collections

Communities

Multi-Armed Bandit Problems under Delayed Feedback