SearchSkip to Search Results
- 1Chubak, Pirooz
- 1Esteki, Afsaneh
- 1Hasnat, Md Arif
- 1Kassenov, Zharkyn
- 1Makazhanov, Aibek
- 1Pinchak, Christopher
- 1Active Learning
- 1Answer Typing
- 1Artificial Intelligence
- 1Attribute Selection
An Interaction-driven Approach for Inferring the Polarity of Collaborations in Wikipedia and Political Preferences on TwitterDownload
In this thesis we explore interactions of users of two major information sources, namely Wikipedia and Twitter. In particular, we show that revision histories of Wikipedia articles contain interaction patterns which can be used to build collaboration profiles of editors. Such profiles can be...
The amount of information available on the Web is rapidly growing, and the need for extracting more useful and relevant data from this tremendously large source has become an interesting research challenge. Among various types of useful information that can be extracted, lists in particular are...
While the World Wide Web has always been treated as an immense source of data, most information it provides is usually deemed unstructured and sometimes ambiguous, which in turn makes it unreliable. But the web also contains a relatively large number of structured data in the form of tables,...
We study the problem of classifying users in a classified ad network and its applications in further analyzing the network. Specifically, we seek to classify Kijiji users into one of the two business and non-business categories. The problem is challenging due to the sparsity of the data about...
Answer typing is an important aspect of the question answering process. Most commonly addressed with the use of a fixed set of possible answer classes via question classification, answer typing influences which answers will ultimately be selected as correct. Answer typing introduces the concept...
With the large presence of organizations from different sectors of economy on the web, the problem of detecting which sector a given website belongs to is both important and challenging. We study the problem of classifying websites into four non-topical categories: public, private, non-profit and...
Edge-labeled graphs are widely used to describe relationships between entities in a database. We study a class of queries on edge-labeled graphs, referred to as exemplar queries, where each query gives an example of what the user is searching for. Given an exemplar query, we study the problem of...
The web contains a large volume of tables that provide structured information about entities and relationships. This data may be used as a source for exploratory searches and to gather information about desired entities. This thesis focuses on one particular exploratory search where given a query...
Lookup tables are frequently used in many applications to store and retrieve keyvalue pairs. Designing efficient lookup tables can be challenging with constraints placed on storage, query response time and/or result accuracy. This thesis proposes Geometric filter, a lookup table with a space...
We study the problem of geotagging named entities where the goal is to identify the most relevant location of a named entity based on the content of the Web pages where the entity is mentioned. We hypothesize the relationship between the mentions of an entity and its geo-center in web pages, and...