Search

Convergence and No-Regret in Multiagent Learning

2004

Technical report TR04-11. Learning in a multiagent system is a challenging problem due to two key factors. First, if other agents are simultaneously learning then the environment is no longer stationary, thus undermining convergence guarantees. Second, learning is often susceptible to...

An improved fully parallel 3D thinning algorithm

Download

2005

Basu, Anup, Wang, Tao

Technical report TR05-31. A 3D thinning algorithm erodes a 3D image layer by layer to extract the skeletons. This paper presents an improved fully parallel 3D thinning algorithm which extracts medial lines from a 3D image. This algorithm is based on Ma and Sonka's thinning algorithm, which fails...

Automatic Estimation of 3D Transformations using Skeletons for Object Alignment

Download

2005

Wang, Tao, Basu, Anup

Technical report TR05-32. An algorithm for automatic estimation of 3D transformations between two objects is presented in this paper. Skeletons of the 3D objects are created with a fully parallel thinning algorithm and feature point pairs (land markers) are extracted from skeletons automatically,...

Dual Representations for Dynamic Programming and Reinforcement Learning

Download

2006

Wang, Tao, Schuurmans, Dale, Bowling, Michael

Technical report TR06-26. We investigate the dual approach to dynamic programming and reinforcement learning, based on maintaining an explicit representation of stationary distributions as opposed to value functions. A significant advantage of the dual approach is that it allows one to exploit...

Computing Robust Counter-Strategies

Download

2007

Johanson, Michael, Bowling, Michael, Zinkevich, Martin

Technical report TR07-15. Adaptation to other initially unknown agents often requires computing an effective counter-strategy. In the Bayesian paradigm, one must find a good counter-strategy to the inferred posterior of the other agents' behavior. In the experts paradigm, one may want to choose...

Dual Representations for Dynamic Programming

Download

2007

Wang, Tao, Bowling, Michael, Lizotte, Daniel, Schuurmans, Dale

Technical report TR07-10. We propose to use a new dual approach to dynamic programming. The idea is to maintain an explicit representation of stationary distributions as opposed to value functions. A significant advantage of the dual approach is that it allows one to exploit well developed...

Stable Dynamic Programming and Reinforcement Learning with Dual Representations

Download

2007

Wang, Tao, Schuurmans, Dale, Bowling, Michael, Lizotte, Daniel

Technical report TR07-05. We investigate novel, dual algorithms for dynamic programming and reinforcement learning, based on maintaining explicit representations of stationary distributions instead of value functions. In particular, we investigate the convergence properties of standard dynamic...

Regret Minimization in Games with Incomplete Information

Download

2007

Bowling, Michael, Johanson, Michael, Zinkevich, Martin, Piccione, Carmelo

Technical report TR07-14. Extensive games are a powerful model of multiagent decision-making scenarios with incomplete information. Finding a Nash equilibrium for very large instances of these games has received a great deal of recent attention. In this paper, we describe a new technique for...

A method for quantitative measurement of gas volume changes in upper airway

Download

2007

Wang, Tao, Basu, Anup

Technical report TR07-01. A method for quantitative measurement of gas volume changes in upper airway is presented in this paper. The aim of this study is to assess the feasibility of a novel Cone Beam Computerized Tomography (CBCT) technique for quantitative measurement of gas volume in upper...

Fluid vector flow and applications in infant brain MRI analysis

Download

2008

Wang, Tao

Technical report TR08-10. A parametric active contour model based on dynamic boundary vector flow is presented in this paper. The contribution of this model is two-fold. First, it has the largest capture range. Second, it is able to extract concave shape. We apply this method to infant brain...

Items (15)

Collections

Communities

Convergence and No-Regret in Multiagent Learning

An improved fully parallel 3D thinning algorithm

Automatic Estimation of 3D Transformations using Skeletons for Object Alignment

Dual Representations for Dynamic Programming and Reinforcement Learning

Computing Robust Counter-Strategies

Dual Representations for Dynamic Programming

Stable Dynamic Programming and Reinforcement Learning with Dual Representations

Regret Minimization in Games with Incomplete Information

A method for quantitative measurement of gas volume changes in upper airway

Fluid vector flow and applications in infant brain MRI analysis