Search

Skip to Search Results
  • Fall 2022

    Lo, Chunlok

    This thesis investigates a new approach to model-based reinforcement learning using background planning: mixing (approximate) dynamic programming updates and model-free updates, similar to the Dyna architecture. Background planning with learned models is often worse than model-free alternatives,...

1 - 1 of 1