Temporal Abstraction in Monte Carlo Tree Search

  • Author / Creator
    Vafadost, Mostafa
  • Given nothing but the generative model of the environment, Monte Carlo Tree Search techniques have recently shown spectacular results on domains previously thought to be intractable. In this thesis we try to develop generic techniques for temporal abstraction inside MCTS that would allow the effective construction of medium/long term plans in arbitrary Atari 2600 games. We introduce two methods: 1- CTW-UCT, that uses CTW algorithm to extract information from expert trajectories in order to make use of this information in search. 2- Variable Time Scale (VTS), that finds the desired time scale for taking each action online. We found that, CTW-UCT did not achieve satisfactory results. The VTS algorithm, however, was shown to be a promising algorithm. Although VTS did not achieve the highest individual scores on any game, it performed close to the best on most of the games. In other words, compared to different UCT algorithms that make use of macro actions with repeated actions of pre-specified lengths, VTS achieved comparable results to the best score on most of the games. We conclude therefore that, for domains where we do not know the desired time scale, VTS can be a good option.

  • Subjects / Keywords
  • Graduation date
    Fall 2013
  • Type of Item
  • Degree
    Master of Science
  • DOI
  • License
    This thesis is made available by the University of Alberta Libraries with permission of the copyright owner solely for non-commercial purposes. This thesis, or any portion thereof, may not otherwise be copied or reproduced without the written consent of the copyright owner, except to the extent permitted by Canadian copyright law.