Search

Filter

Departments

25Department of Computing Science

Item type

25Thesis

Languages

25English

Collections

Supervisors

1Zaiane, Osmar (Computing Science)

Show 1 more ...

Author / Creator / Contributor

Show 4 more ...

Subject / Keyword

Show 4 more ...

Year

Geometric Filter: A Space and Time Efficient Lookup Table with Bounded Error
Download

Fall 2009

Zhao, Yang

Lookup tables are frequently used in many applications to store and retrieve keyvalue pairs. Designing efficient lookup tables can be challenging with constraints placed on storage, query response time and/or result accuracy. This thesis proposes Geometric filter, a lookup table with a space...
Class-free answer typing
Download

Fall 2009

Pinchak, Christopher

Answer typing is an important aspect of the question answering process. Most commonly addressed with the use of a fixed set of possible answer classes via question classification, answer typing influences which answers will ultimately be selected as correct. Answer typing introduces the concept...
Indexing and Querying Natural Language Text
Download

Spring 2012

Chubak, Pirooz

Natural language text is a prominent source of representing and communicating information and knowledge. It is often desirable to search in granularities of text that are smaller than a document or to query the syntactic roles and relationships within syntactically annotated text sentences, often...
Classifying Websites into Non-topical Categories
Download

Fall 2012

Thapa, Chaman

With the large presence of organizations from different sectors of economy on the web, the problem of detecting which sector a given website belongs to is both important and challenging. We study the problem of classifying websites into four non-topical categories: public, private, non-profit and...
An Interaction-driven Approach for Inferring the Polarity of Collaborations in Wikipedia and Political Preferences on Twitter
Download

Spring 2013

Makazhanov, Aibek

In this thesis we explore interactions of users of two major information sources, namely Wikipedia and Twitter. In particular, we show that revision histories of Wikipedia articles contain interaction patterns which can be used to build collaboration profiles of editors. Such profiles can be...
Analyzing And Extracting Lists On The Web
Download

Fall 2013

Esteki, Afsaneh

The amount of information available on the Web is rapidly growing, and the need for extracting more useful and relevant data from this tremendously large source has become an interesting research challenge. Among various types of useful information that can be extracted, lists in particular are...
Characterizing Users in a Classified Ad Network
Download

Fall 2014

Waqar, Muhammad

We study the problem of classifying users in a classified ad network and its applications in further analyzing the network. Specifically, we seek to classify Kijiji users into one of the two business and non-business categories. The problem is challenging due to the sparsity of the data about...
Geotagging Named Entities in Web Pages
Download

Fall 2014

Yu, Jiangwei

We study the problem of geotagging named entities where the goal is to identify the most relevant location of a named entity based on the content of the Web pages where the entity is mentioned. We hypothesize the relationship between the mentions of an entity and its geo-center in web pages, and...
Offline Strategies for Online Set Expansion
Download

Spring 2016

Zhou, Kai

Set expansion aims at expanding a given query seed set into a larger and more complete set by adding elements that are likely to belong to the same grouping as the elements of the query set. This thesis studies the problem of efficient set expansion; in particular, given a collection of data...
Annotating Web Tables Using Surface Text Patterns
Download

Spring 2016

Wang, Andong

While the World Wide Web has always been treated as an immense source of data, most information it provides is usually deemed unstructured and sometimes ambiguous, which in turn makes it unreliable. But the web also contains a relatively large number of structured data in the form of tables,...