Search
Skip to Search Results- 3Computer Games
- 3Reinforcement Learning
- 2Computer Vision and Multimedia Communications
- 2Extensive games
- 2Game theory
- 2Online learning
-
2005
Technical report TR05-31. A 3D thinning algorithm erodes a 3D image layer by layer to extract the skeletons. This paper presents an improved fully parallel 3D thinning algorithm which extracts medial lines from a 3D image. This algorithm is based on Ma and Sonka's thinning algorithm, which fails...
-
2005
Technical report TR05-32. An algorithm for automatic estimation of 3D transformations between two objects is presented in this paper. Skeletons of the 3D objects are created with a fully parallel thinning algorithm and feature point pairs (land markers) are extracted from skeletons automatically,...
-
2006
Wang, Tao, Schuurmans, Dale, Bowling, Michael
Technical report TR06-26. We investigate the dual approach to dynamic programming and reinforcement learning, based on maintaining an explicit representation of stationary distributions as opposed to value functions. A significant advantage of the dual approach is that it allows one to exploit...
-
2007
Johanson, Michael, Bowling, Michael, Zinkevich, Martin
Technical report TR07-15. Adaptation to other initially unknown agents often requires computing an effective counter-strategy. In the Bayesian paradigm, one must find a good counter-strategy to the inferred posterior of the other agents' behavior. In the experts paradigm, one may want to choose...
-
2007
Wang, Tao, Bowling, Michael, Lizotte, Daniel, Schuurmans, Dale
Technical report TR07-10. We propose to use a new dual approach to dynamic programming. The idea is to maintain an explicit representation of stationary distributions as opposed to value functions. A significant advantage of the dual approach is that it allows one to exploit well developed...
-
2007
Wang, Tao, Schuurmans, Dale, Bowling, Michael, Lizotte, Daniel
Technical report TR07-05. We investigate novel, dual algorithms for dynamic programming and reinforcement learning, based on maintaining explicit representations of stationary distributions instead of value functions. In particular, we investigate the convergence properties of standard dynamic...
-
2007
Bowling, Michael, Johanson, Michael, Zinkevich, Martin, Piccione, Carmelo
Technical report TR07-14. Extensive games are a powerful model of multiagent decision-making scenarios with incomplete information. Finding a Nash equilibrium for very large instances of these games has received a great deal of recent attention. In this paper, we describe a new technique for...
-
2007
Technical report TR07-01. A method for quantitative measurement of gas volume changes in upper airway is presented in this paper. The aim of this study is to assess the feasibility of a novel Cone Beam Computerized Tomography (CBCT) technique for quantitative measurement of gas volume in upper...
-
2008
Technical report TR08-10. A parametric active contour model based on dynamic boundary vector flow is presented in this paper. The contribution of this model is two-fold. First, it has the largest capture range. Second, it is able to extract concave shape. We apply this method to infant brain...