Search

Skip to Search Results
  • Spring 2021

    Ni, Jingjiao

    Emphatic-Temporal-Difference (Emphatic-TD) learning algorithms were recently proposed based on the most central and widely used reinforcement learning algorithms, Temporal-Difference (TD) methods. Emphatic-TD learning algorithms were originally designed to solve the divergence problem of...

1 - 1 of 1