Search

Skip to Search Results
  • Fall 2011

    Maei, Hamid Reza

    We present a new family of gradient temporal-difference (TD) learning methods with function approximation whose complexity, both in terms of memory and per-time-step computation, scales linearly with the number of learning parameters. TD methods are powerful prediction techniques, and with...

1 - 1 of 1