Search
Skip to Search Results- 22Rafiei, Davood (Computing Science)
- 4Mou, Lili (Computing Science)
- 2Sander, Joerg (Computing Science)
- 1Barbosa, Denilson (Computing Science)
- 1Lin, Dekang (Computing Science)
- 1Nascimento, Mario (Computing Science)
-
Fall 2009
Lookup tables are frequently used in many applications to store and retrieve keyvalue pairs. Designing efficient lookup tables can be challenging with constraints placed on storage, query response time and/or result accuracy. This thesis proposes Geometric filter, a lookup table with a space...
-
Fall 2009
Answer typing is an important aspect of the question answering process. Most commonly addressed with the use of a fixed set of possible answer classes via question classification, answer typing influences which answers will ultimately be selected as correct. Answer typing introduces the concept...
-
Spring 2012
Natural language text is a prominent source of representing and communicating information and knowledge. It is often desirable to search in granularities of text that are smaller than a document or to query the syntactic roles and relationships within syntactically annotated text sentences, often...
-
Fall 2012
With the large presence of organizations from different sectors of economy on the web, the problem of detecting which sector a given website belongs to is both important and challenging. We study the problem of classifying websites into four non-topical categories: public, private, non-profit and...
-
An Interaction-driven Approach for Inferring the Polarity of Collaborations in Wikipedia and Political Preferences on Twitter
DownloadSpring 2013
In this thesis we explore interactions of users of two major information sources, namely Wikipedia and Twitter. In particular, we show that revision histories of Wikipedia articles contain interaction patterns which can be used to build collaboration profiles of editors. Such profiles can be...
-
Fall 2013
The amount of information available on the Web is rapidly growing, and the need for extracting more useful and relevant data from this tremendously large source has become an interesting research challenge. Among various types of useful information that can be extracted, lists in particular are...
-
Fall 2014
We study the problem of classifying users in a classified ad network and its applications in further analyzing the network. Specifically, we seek to classify Kijiji users into one of the two business and non-business categories. The problem is challenging due to the sparsity of the data about...
-
Fall 2014
We study the problem of geotagging named entities where the goal is to identify the most relevant location of a named entity based on the content of the Web pages where the entity is mentioned. We hypothesize the relationship between the mentions of an entity and its geo-center in web pages, and...
-
Spring 2016
Set expansion aims at expanding a given query seed set into a larger and more complete set by adding elements that are likely to belong to the same grouping as the elements of the query set. This thesis studies the problem of efficient set expansion; in particular, given a collection of data...
-
Spring 2016
While the World Wide Web has always been treated as an immense source of data, most information it provides is usually deemed unstructured and sometimes ambiguous, which in turn makes it unreliable. But the web also contains a relatively large number of structured data in the form of tables,...