Search
Skip to Search Results- 19Planning
- 6Abstractions
- 5Artificial Intelligence
- 5Heuristic Search
- 5Reinforcement Learning
- 2Game Theory
- 1Asadi Atui, Kavosh
- 1Barriga Richards, Nicolas A
- 1Brown, Jennifer A.
- 1Faid, Julian TW
- 1Fan, Gaojian
- 1Hawkin, John A
-
Social Actor Engagement in Municipal Decision-Making for Parks, Planning, and Civil Society in Edmonton, Alberta, Canada 1960-2010: Institutional Intersections
DownloadSpring 2020
Edmonton, Alberta, has a unique approach to public spaces that sees conjoined creation and development sharing of public spaces for the collective benefit of the community and stakeholders; this approach began 100 years ago. Green or open spaces, natural areas, the river valley, City of Edmonton...
-
Strengths, Weaknesses, and Combinations of Model-based and Model-free Reinforcement Learning
DownloadSpring 2016
Reinforcement learning algorithms are conventionally divided into two approaches: a model-based approach that builds a model of the environment and then computes a value function from the model, and a model-free approach that directly estimates the value function. The first contribution of this...
-
Spring 2023
AlphaZero is a self-play reinforcement learning algorithm that achieves superhuman play in the games of chess, shogi, and Go via policy iteration. To be an effective policy improvement operator, AlphaZero’s search needs to have accurate value estimates for the states that appear in its search...
-
Fall 2019
In this thesis, we study merge-and-shrink (M&S), a flexible abstraction technique for generating heuristics for cost optimal planning. We first propose three novel merging strategies for M&S, namely, Undirected Min-Cut (UMC), Maximum Intermediate Abstraction Size Minimizing (MIASM), and Dynamic...
-
Spring 2016
Game theoretic solution concepts, such as Nash equilibrium strategies that are optimal against worst case opponents, provide guidance in finding desirable autonomous agent behaviour. In particular, we wish to approximate solutions to complex, dynamic tasks, such as negotiation or bidding in...