Search
Skip to Search Results- 22Rafiei, Davood (Computing Science)
- 14Sander, Joerg (Computing Science)
- 2Nascimento, Mario (Computing Science)
- 1Lin, Dekang (Computing Science)
- 1Mou, Lili (Computing Science)
- 1Nascimento, Mario A. (Computing Science)
- 1Barua, Sajib
- 1Chubak, Pirooz
- 1Esteki, Afsaneh
- 1Hasnat, Md Arif
- 1Jullion, Zachary M
- 1Kamalloo, Ehsan
- 3Databases
- 2Clustering
- 2Geotagging
- 2Information Retrieval
- 2Natural Language Processing
- 2Question Answering
-
Fall 2017
Density-based clustering methods extract high density clusters which are separated by regions of lower density. HDBSCAN* is an existing algorithm for producing a density-based cluster hierarchy. To obtain clusters from this hierarchy it includes an instance of FOSC(Framework for Optimal Selection...
-
Fall 2013
Many clustering techniques require parameter settings and depending on an algorithms sensitivity to the parameter, the choice of the parameter value can be very important. Several approaches have been proposed to find the “best” value of the clustering parameter for the different unsupervised...
-
An Interaction-driven Approach for Inferring the Polarity of Collaborations in Wikipedia and Political Preferences on Twitter
DownloadSpring 2013
In this thesis we explore interactions of users of two major information sources, namely Wikipedia and Twitter. In particular, we show that revision histories of Wikipedia articles contain interaction patterns which can be used to build collaboration profiles of editors. Such profiles can be...
-
Fall 2013
The amount of information available on the Web is rapidly growing, and the need for extracting more useful and relevant data from this tremendously large source has become an interesting research challenge. Among various types of useful information that can be extracted, lists in particular are...
-
Spring 2016
While the World Wide Web has always been treated as an immense source of data, most information it provides is usually deemed unstructured and sometimes ambiguous, which in turn makes it unreliable. But the web also contains a relatively large number of structured data in the form of tables,...
-
Fall 2021
The problem of aspect-based recommendation---recommending an "item" to a "recommendation recipient" based on "aspects", i.e., information about the characteristic features of the item that may be of interest to the recommendation recipient or what makes an item a good match for a recommendation...
-
Fall 2014
We study the problem of classifying users in a classified ad network and its applications in further analyzing the network. Specifically, we seek to classify Kijiji users into one of the two business and non-business categories. The problem is challenging due to the sparsity of the data about...
-
Fall 2009
Answer typing is an important aspect of the question answering process. Most commonly addressed with the use of a fixed set of possible answer classes via question classification, answer typing influences which answers will ultimately be selected as correct. Answer typing introduces the concept...
-
Fall 2012
With the large presence of organizations from different sectors of economy on the web, the problem of detecting which sector a given website belongs to is both important and challenging. We study the problem of classifying websites into four non-topical categories: public, private, non-profit and...
-
Fall 2018
Identifying the peptide sequence from a mass spectrum is done either by database search or De novo peptide sequencing. This thesis focuses on identification of peptides by using database search, which is a process where an MS/MS spectrum is searched against an entire database of spectra...