Search

Skip to Search Results
  • Spring 2022

    Sina Ghiassian

    In this dissertation, we study online off-policy temporal-difference learning algorithms, a class of reinforcement learning algorithms that can learn predictions in an efficient and scalable manner. The contributions of this dissertation are one of the two kinds: (1) empirically studying existing...

1 - 1 of 1