A Data Cleaning Framework for Trajectory Clustering

  • Author / Creator
    Idrissov, Agzam Y.
  • Recent proliferation of low-cost and lightweight GPS tracking devices led to a large increase in the amounts of collected mobility data. The rapidly emerging field of location-based services requires accurate and informative knowledge mining from these large quantities of data. One such mobility knowledge mining task is trajectory clustering, where one tries to find paths that have been travelled frequently. Most existing trajectory clustering techniques do not discuss cleaning the data before applying a clustering algorithm. Since “noisy” data can have a significant effect on the clustering process, preprocessing such trajectory data will likely improve trajectory clustering results. In this thesis, we present a trajectory data cleaning framework, which consists of four steps: Outlier Detection, Stop Detection, Interpolation and Map Matching. We evaluate our framework using popular clustering algorithms and distance functions, and show that our proposed preprocessing (cleaning) framework indeed does improve the quality of obtained clusters.

  • Subjects / Keywords
  • Graduation date
    Fall 2012
  • Type of Item
  • Degree
    Master of Science
  • DOI
  • License
    This thesis is made available by the University of Alberta Libraries with permission of the copyright owner solely for non-commercial purposes. This thesis, or any portion thereof, may not otherwise be copied or reproduced without the written consent of the copyright owner, except to the extent permitted by Canadian copyright law.
  • Language
  • Institution
    University of Alberta
  • Degree level
  • Department
  • Supervisor / co-supervisor and their department(s)
  • Examining committee members and their departments
    • Nascimento, Mario A. (Computing Science)
    • Yuan, Li-Yan (Computing Science)
    • Barbosa, Denilson (Computing Science) - Chair
    • Yang, Herb (Computing Science)