Search

Skip to Search Results
  • Fall 2016

    Wu, Yifan

    In an online learning problem a player makes decisions in a sequential manner. In each round, the player receives some reward that depends on his action and an outcome generated by the environment while some feedback information about the outcome is revealed. The goal of the player can be...

1 - 1 of 1