Search
Skip to Search Results- 14Exploration
- 3Reinforcement Learning
- 2Artificial Intelligence
- 1Africa
- 1Atari 2600
- 1Atlantic Ocean Islands
- 1Birdsell, J.M.
- 1Bixby, Rebecca J.
- 1Boston, Penelope J.
- 1Boutin, S.
- 1Cholodovskis Machado, Marlos
- 1Cummings, G.G.
- 6Graduate and Postdoctoral Studies (GPS), Faculty of
- 6Graduate and Postdoctoral Studies (GPS), Faculty of/Theses and Dissertations
- 2Toolkit for Grant Success
- 2Toolkit for Grant Success/Educational Materials (Toolkit for Grant Success)
- 1Helmholtz-Alberta Initiative
- 1Helmholtz-Alberta Initiative/Journal Articles & Research Abstracts (Helmholtz-Alberta Initiative)
-
"Un Remedde Contre Toutes Maladies": Travel Writing and the Scurvy Incident in Cartier's Second Voyage
Download2012-01-01
This is the accepted version of the following article: True, Micah. “Un Remedde Contre Toutes Maladies”: Travel Writing and the Scurvy Incident in Cartier’s Second Voyage.” Quebec Studies, vol. 54, no. 1, 2012, pp. 3–16., which has been published in final form at https://doi.org/...
-
2021 New Frontiers in Research Fund - Exploration Competition Webinar Slides (English/French)
Download2021-08-01
This presentation was presented by NFRF for the 2021 Exploration Competition.
-
2013
Sturtevant, Nathan R., Valenzano, Richard, Schaeffer, Jonathan
While greedy best-first search (GBFS) is a popular algorithm for solving automated planning tasks, it can exhibit poor performance if the heuristic in use mistakenly identifies a region of the search space as promising. In such cases, the way the algorithm greedily trusts the heuristic can cause...
-
Spring 2020
Reinforcement Learning is a formalism for learning by trial and error. Unfortunately, trial and error can take a long time to find a solution if the agent does not efficiently explore the behaviours available to it. Moreover, how an agent ought to explore depends on the task that the agent is...
-
2009
Estabrooks, C.A., Norton, P.G., Cummings, G.G., Squires, J.E., Birdsell, J.M.
Background: The context of healthcare organizations such as hospitals is increasingly accepted as having the potential to influence the use of new knowledge. However, the mechanisms by which the organizational context influences evidence-based practices are not well understood. Current measures...
-
Spring 2019
In the reinforcement learning (RL) problem an agent must learn how to act optimally through trial-and-error interactions with a complex, unknown, stochastic environment. The actions taken by the agent influence not just the immediate reward it observes but also the future states and rewards it...
-
Improving Deep Deterministic Policy Gradient for Sparse Reward and Goal-Conditioned Continuous Control
DownloadSpring 2024
We propose an improved version of deep deterministic policy gradient (DDPG) for sparse reward and goal-conditioned reinforcement learning. To enhance exploration, we introduce \emph{${\epsilon}{t}$-greedy}, which uses search to generate exploratory options, focusing on less-visited states. We...
-
1994
Movement and settlement patterns of animal offspring, along with the costs of occupying familiar and unfamiliar habitats, have been inferred frequently, but rarely have they been documented directly. To obtain such information, we monitored the individual fates of 205 (94%) of the 219 offspring...
-
2021-06-16
Audio recording of the NFRF-Exploration 2021 Roundtable with Evaluators. In this session, eight of the multidisciplinary review panel members who participated in the NFRF Exploration application review share their insights and tips. The evaluators talk about how reviewers assess applications and...