Download the full-sized PDF of A Contextual Approach towards More Accurate Duplicate Bug Report DetectionDownload the full-sized PDF



Permanent link (DOI):


Export to: EndNote  |  Zotero  |  Mendeley


This file is in the following communities:

Graduate Studies and Research, Faculty of


This file is in the following collections:

Theses and Dissertations

A Contextual Approach towards More Accurate Duplicate Bug Report Detection Open Access


Other title
Software context
Issue-tracking systems
Duplicate bug reports
Bug report triaging
Type of item
Degree grantor
University of Alberta
Author or creator
Alipour, Anahita
Supervisor and department
Stroulia, Eleni (Computing Science)
Hindle, Abram (Computing Science)
Examining committee member and department
Miller, James (Department of Electrical and Computer Engineering)
Barbosa, Denilson (Computing Science)
Department of Computing Science

Date accepted
Graduation date
Master of Science
Degree level
The issue-tracking systems used by software projects contain issues or bugs written by a wide variety of bug reporters, with different levels of knowledge about the system under development. Typically, reporters lack the skills and/or time to search the issue-tracking system for similar issues already reported. Hence, many reports end up referring to the same issue, which effectively makes the bug-report triaging process time consuming and error prone. Many researchers have approached the bug-deduplication problem using off-the-shelf information-retrieval tools. In this thesis, we extend the state of the art by investigating how contextual information about software-quality attributes, software-architecture terms, and system-development topics can be exploited to improve bug-deduplication. We demonstrate the effectiveness of our contextual bug-deduplication method on the bug repository of Android, Eclipse, Mozilla, and OpenOffice Software Systems. Based on this experience, we conclude that researchers should not ignore the context of the software engineering domain for deduplication.
Permission is hereby granted to the University of Alberta Libraries to reproduce single copies of this thesis and to lend or sell such copies for private, scholarly or scientific research purposes only. Where the thesis is converted to, or otherwise made available in digital form, the University of Alberta will advise potential users of the thesis of these terms. The author reserves all other publication and other rights in association with the copyright in the thesis and, except as herein before provided, neither the thesis nor any substantial portion thereof may be printed or otherwise reproduced in any material form whatsoever without the author's prior written permission.
Citation for previous publication
Alipour, Anahita; Hindle, Abram; Stroulia, Eleni. (2013). A Contextual Approach towards More Accurate Duplicate Bug Report Detection.

File Details

Date Uploaded
Date Modified
Audit Status
Audits have not yet been run on this file.
File format: pdf (Portable Document Format)
Mime type: application/pdf
File size: 1328390
Last modified: 2015:10:12 13:20:01-06:00
Filename: Alipour_Anahita_Fall 2013.pdf
Original checksum: 6a64a512a830fab95d4b740931b9d5e9
Well formed: false
Valid: false
Status message: No document catalog dictionary offset=0
Activity of users you follow
User Activity Date