ERA

Download the full-sized PDF of Characterizing Users in a Classified Ad NetworkDownload the full-sized PDF

Analytics

Share

Permanent link (DOI): https://doi.org/10.7939/R3028PK9T

Download

Export to: EndNote  |  Zotero  |  Mendeley

Communities

This file is in the following communities:

Graduate Studies and Research, Faculty of

Collections

This file is in the following collections:

Theses and Dissertations

Characterizing Users in a Classified Ad Network Open Access

Descriptions

Other title
Subject/Keyword
Social Network Analysis
User Modeling
Classified Ad Network
Type of item
Thesis
Degree grantor
University of Alberta
Author or creator
Waqar, Muhammad
Supervisor and department
Rafiei, Davood (Computing Science)
Examining committee member and department
Moore, Sarah (Business)
Rafiei, Davood (Computing Science)
Barbosa, Denilson (Computing Science)
Wong, Kenny (Computing Science)
Department
Department of Computing Science
Specialization

Date accepted
2014-08-26T15:50:32Z
Graduation date
2014-11
Degree
Master of Science
Degree level
Master's
Abstract
We study the problem of classifying users in a classified ad network and its applications in further analyzing the network. Specifically, we seek to classify Kijiji users into one of the two business and non-business categories. The problem is challenging due to the sparsity of the data about users, the vague separation of the two classes, and the highly imbalanced distribution of users between the classes. Our work utilizes the ad content to build a set of distinctive terms for each class (profile). Given the statistics on how an ad mentions terms from a class profile, the affinity of an ad (and subsequently a user) to a particular class is determined. Our experiments reveal that this is an effective strategy for classifying users, outperforming various baselines. We study the impact of profile size on the classification task and observe that using longer class profiles may not be helpful. Moreover, in the absence of labeled training data, we show that a simple bootstrapping technique with only a few n-grams as a seed set can give nearly good results in terms of F-measure. We also study the same problem from a different angle: collective behavior of a user in posting ads. Using features associated with such behavior, we identify four distinct usage patterns for the users of the Kijiji network and study the association of business and non-business users with these patterns. Our experiments reveal that a sizeable number of members from both user groups validly manifest all the patterns, due to which the aforementioned features are inadequate for the classification task. Finally, using the results of user classification, we analyze the Kijiji network from various aspects. Our results, for example, indicate that businesses are more amenable to post consistently in a particular set of categories than non-business users and that the popularity of different categories for both the user groups exhibits various seasonal trends.
Language
English
DOI
doi:10.7939/R3028PK9T
Rights
Permission is hereby granted to the University of Alberta Libraries to reproduce single copies of this thesis and to lend or sell such copies for private, scholarly or scientific research purposes only. Where the thesis is converted to, or otherwise made available in digital form, the University of Alberta will advise potential users of the thesis of these terms. The author reserves all other publication and other rights in association with the copyright in the thesis and, except as herein before provided, neither the thesis nor any substantial portion thereof may be printed or otherwise reproduced in any material form whatsoever without the author's prior written permission.
Citation for previous publication

File Details

Date Uploaded
Date Modified
2015-01-08T08:02:26.070+00:00
Audit Status
Audits have not yet been run on this file.
Characterization
File format: pdf (PDF/A)
Mime type: application/pdf
File size: 1172200
Last modified: 2015:10:18 02:07:57-06:00
Filename: Waqar_Muhammad_201408_MS.pdf
Original checksum: 80a05d01a33748c69aec68a0957d114d
Activity of users you follow
User Activity Date