Search
Skip to Search Results- 1Chen, Haolan
- 1Gertsberg, Vladimir.
- 1Jewison, Timothy
- 1Montague, John J
- 1Munshi, Amr
- 1Normandeau, Christopher M.
-
Spring 2016
While the World Wide Web has always been treated as an immense source of data, most information it provides is usually deemed unstructured and sometimes ambiguous, which in turn makes it unreliable. But the web also contains a relatively large number of structured data in the form of tables,...
-
Fall 2016
Big data applications demand and consequently lead to developments of diverse scalable data management systems, ranging from NoSQL systems to the emerging NewSQL systems. In order to serve thousands of applications and their huge amounts of data, data management systems must be capable of...
-
Spring 2014
Metabolomics involves the high throughput characterization of small molecules or metabolites in cells, tissues and organisms. To interpret, store and exchange metabolomic data it is necessary to have comprehensive, electronically accessible databases that can be used to handle both the...
-
Fall 2015
There is a concern that brain tumours are underreported in the Alberta Cancer Registry (ACR), yet no studies have been performed to investigate this issue. This study addresses the concern of underreporting of brain tumours in the ACR by assessing case ascertainment of primary brain tumours cases...
-
Spring 2020
The web contains a large volume of tables that provide structured information about entities and relationships. This data may be used as a source for exploratory searches and to gather information about desired entities. This thesis focuses on one particular exploratory search where given a query...
-
Spring 2016
Set expansion aims at expanding a given query seed set into a larger and more complete set by adding elements that are likely to belong to the same grouping as the elements of the query set. This thesis studies the problem of efficient set expansion; in particular, given a collection of data...
-
Public Health Applications Using Big Data and Machine Learning Methods: Name- and Location-based Aboriginal Ethnicity Classification and Sentiment Analysis of Breast Cancer Screening in the United States Using Twitter
DownloadFall 2017
Applications using big data and machine learning techniques are transforming how people live in the 21st century, however they are generally underutilized in public health compared to other domains. We proposed and conducted two independent studies to investigate how big data and machine learning...
-
Separating-Plane Factorization Models: Scalable Recommendation from One-class Implicit Feedback
DownloadFall 2016
We study the large-scale video recommendation problem based on user viewing logs instead of explicit ratings. As viewing records are implicitly positive samples, existing matrix factorization methods fail to generate discriminative recommendations based on such one-class data. We propose a...
-
Spring 2016
How can the principles and concepts applied by visual communication designers be used to assist in exploring and understanding the massive, complex volumes of data now available to Digital Humanities researchers? One method we might employ to help us more easily comprehend the implications of...