Search
Skip to Search Results
Filter
Author / Creator / Contributor
Subject / Keyword
Year
Collections
Languages
Item type
Departments
Supervisors
-
Spring 2023
AlphaZero is a self-play reinforcement learning algorithm that achieves superhuman play in the games of chess, shogi, and Go via policy iteration. To be an effective policy improvement operator, AlphaZero’s search needs to have accurate value estimates for the states that appear in its search...
1 - 1 of 1