ERA

Download the full-sized PDF of Question Answering for BiomedicineDownload the full-sized PDF

Analytics

Share

Permanent link (DOI): https://doi.org/10.7939/R3WH2DK7T

Download

Export to: EndNote  |  Zotero  |  Mendeley

Communities

This file is in the following communities:

Graduate Studies and Research, Faculty of

Collections

This file is in the following collections:

Theses and Dissertations

Question Answering for Biomedicine Open Access

Descriptions

Other title
Subject/Keyword
Question Answering
Bioinformatics
Natural Language Processing
Machine Learning
Artificial Intelligence
Type of item
Thesis
Degree grantor
University of Alberta
Author or creator
Liu, Yifeng
Supervisor and department
Wishart, David (Computing Science)
Examining committee member and department
Wishart, David (Computing Science)
Zaiane, Osmar (Computing Science)
Greiner, Russell (Computing Science)
Gallin, Warren (Biological Science)
Pavlidis, Paul (University of British Columbia)
Department
Department of Computing Science
Specialization

Date accepted
2016-09-27T17:01:00Z
Graduation date
2016-06:Fall 2016
Degree
Doctor of Philosophy
Degree level
Doctoral
Abstract
The field of biomedicine is reeling from “information overload”. Indeed, biomedical researchers find it almost impossible to stay current with published literature due to the vast amounts of data being generated and published. As a result, they are turning to text mining. Over the past two decades the field of biomedical text mining has experienced significant advances, such as the development of high quality biomedical knowledge bases and ontologies, the construction of biomedical search engines and the development of biomedical relationship mining tools. However, users still have to manually examine the retrieved documents and connect snippets of information from various databases to find answers to their queries. Ideally what is needed is a “wise” question answering (QA) system. With the advances in QA systems, including the triumph of IBM Watson on Jeopardy!, many biomedical researchers, including myself, believe that now is the time to further advance biomedical text mining by developing a biomedical question answering system. Such a system would be able to answer questions regarding biomedical entities and help researchers better digest existing knowledge and formulate new hypothesis. The task of biomedical question answering is faced with two central challenges: 1) retrieving relevant information from heterogeneous data sources (structured databases and free-text collections), and 2) formulating natural language answers from retrieved concepts and snippets. My research focuses on developing an association mining tool (PolySearch2) and a web-based biomedical question answering system (BioQA), that would provide precise answers with encyclopedia-like commentary to a wide range of biomedical questions. In particular, PolySearch2 mines concept associations from free-text collections based on co-occurrence statistics. BioQA uses PolySearch2 and other tools to decode natural language questions and formulate natural language answers for both descriptive and associative queries. Both PolySearch2 and BioQA offer public web interface to answer questions posed by biomedical researchers, physicians, students and the inquisitive public. PolySearch2 and BioQA represent an integrated solution to the core challenges in biomedical question answering.
Language
English
DOI
doi:10.7939/R3WH2DK7T
Rights
This thesis is made available by the University of Alberta Libraries with permission of the copyright owner solely for the purpose of private, scholarly or scientific research. This thesis, or any portion thereof, may not otherwise be copied or reproduced without the written consent of the copyright owner, except to the extent permitted by Canadian copyright law.
Citation for previous publication
Liu, Y., Liang, Y., Wishart, D. (2015) PolySearch2: a significantly improved text-mining system for discovering associations between human diseases, genes, drugs, metabolites, toxins and more. Nucleic Acids Research. Jul 1;43(W1):W535-42.

File Details

Date Uploaded
Date Modified
2016-09-27T23:01:01.227+00:00
Audit Status
Audits have not yet been run on this file.
Characterization
File format: pdf (PDF/A)
Mime type: application/pdf
File size: 7480814
Last modified: 2016:11:16 14:04:57-07:00
Filename: Liu_Yifeng_201609_PhD.pdf
Original checksum: 45db23dfa94de03808c1c07ecb4e09fe
Activity of users you follow
User Activity Date