Multiple-Choice Question Answering Over Semi-Structured Tables

  • Author / Creator
    Ni, Weite
  • Question answering (QA) is the task of automatically finding answers to natural language questions. A QA system requires access to some form of knowledge in order to find the answers. Most QA tasks use raw text corpora or structured knowledge bases as knowledge. However, raw text corpora, although easy to get in large quantities, are hard to reason with by machines. Structured knowledge bases are easy to reason with, but require manual effort to normalize. We view semi-structured tables as a compromise between raw text corpora and structured knowledge bases. Semi-structured tables require less manual effort to build comparing with structured knowledge bases, and their structured properties make it easy for automated reasoning.

    In this thesis, we build a QA system that can answer multiple-choice questions based on semi-structured tables. We tackle the task in two steps: table retrieval and answer selection. To retrieve the most relevant table to the questions, we build a feature-based model that can effectively take the candidate choices into account. To find the best answer based on the retrieved table, we first measure the relevance between the question and rows in the table, then extract the best answer from the most relevant rows. Evaluation on the TabMCQ benchmark shows that our system achieves a huge improvement over the previous state-of-the-art system.

  • Subjects / Keywords
  • Graduation date
    Fall 2019
  • Type of Item
  • Degree
    Master of Science
  • DOI
  • License
    Permission is hereby granted to the University of Alberta Libraries to reproduce single copies of this thesis and to lend or sell such copies for private, scholarly or scientific research purposes only. Where the thesis is converted to, or otherwise made available in digital form, the University of Alberta will advise potential users of the thesis of these terms. The author reserves all other publication and other rights in association with the copyright in the thesis and, except as herein before provided, neither the thesis nor any substantial portion thereof may be printed or otherwise reproduced in any material form whatsoever without the author's prior written permission.