Skip to main content

Lund University Publications

LUND UNIVERSITY LIBRARIES

A data-driven approach for understanding invalid bug reports: An industrial case study

Laiq, Muhammad ; Ali, Nauman Bin ; Börstler, Jürgen and Engström, Emelie LU orcid (2023) In Information and Software Technology 164.
Abstract
Context:
Bug reports created during software development and maintenance do not always describe deviations from a system’s valid behavior. Such invalid bug reports may consume significant resources and adversely affect the prioritization and resolution of valid bug reports. There is a need to identify preventive actions to reduce the inflow of invalid bug reports. Existing research has shown that manually analyzing invalid bug report descriptions provides cues regarding preventive actions. However, such a manual approach is not cost-effective due to the time required to analyze a sufficiently large number of bug reports needed to identify useful patterns. Furthermore, the analysis needs to be repeated as the underlying causes of... (More)
Context:
Bug reports created during software development and maintenance do not always describe deviations from a system’s valid behavior. Such invalid bug reports may consume significant resources and adversely affect the prioritization and resolution of valid bug reports. There is a need to identify preventive actions to reduce the inflow of invalid bug reports. Existing research has shown that manually analyzing invalid bug report descriptions provides cues regarding preventive actions. However, such a manual approach is not cost-effective due to the time required to analyze a sufficiently large number of bug reports needed to identify useful patterns. Furthermore, the analysis needs to be repeated as the underlying causes of invalid bug reports change over time.

Objective:
In this study, we propose and evaluate the use of Latent Dirichlet Allocation (LDA), a topic modeling approach, to support practitioners in suggesting preventive actions to avoid the creation of similar invalid bug reports in the future.

Method:
In an industrial case study, we first manually analyzed descriptions of invalid bug reports to identify common patterns in their descriptions. We further investigated to what extent LDA can support this manual process. We used expert-based validation to evaluate the relevance of identified common patterns and their usefulness in suggesting preventive measures.

Results:
We found that invalid bug reports have common patterns that are perceived as relevant, and they can be used to devise preventive measures. Furthermore, the identification of common patterns can be supported with automation.

Conclusion:
Using LDA, practitioners can effectively identify representative groups of bug reports (i.e., relevant common patterns) from a large number of bug reports and analyze them further to devise preventive measures. (Less)
Please use this url to cite or link to this publication:
author
; ; and
organization
publishing date
type
Contribution to journal
publication status
published
subject
in
Information and Software Technology
volume
164
article number
107305
publisher
Elsevier
external identifiers
  • scopus:85166970380
ISSN
0950-5849
DOI
10.1016/j.infsof.2023.107305
language
English
LU publication?
yes
id
c041f362-0196-4d61-ad22-6ccad38d1c8f
date added to LUP
2023-09-22 19:25:10
date last changed
2023-11-21 22:32:43
@article{c041f362-0196-4d61-ad22-6ccad38d1c8f,
  abstract     = {{Context:<br/>Bug reports created during software development and maintenance do not always describe deviations from a system’s valid behavior. Such invalid bug reports may consume significant resources and adversely affect the prioritization and resolution of valid bug reports. There is a need to identify preventive actions to reduce the inflow of invalid bug reports. Existing research has shown that manually analyzing invalid bug report descriptions provides cues regarding preventive actions. However, such a manual approach is not cost-effective due to the time required to analyze a sufficiently large number of bug reports needed to identify useful patterns. Furthermore, the analysis needs to be repeated as the underlying causes of invalid bug reports change over time.<br/><br/>Objective:<br/>In this study, we propose and evaluate the use of Latent Dirichlet Allocation (LDA), a topic modeling approach, to support practitioners in suggesting preventive actions to avoid the creation of similar invalid bug reports in the future.<br/><br/>Method:<br/>In an industrial case study, we first manually analyzed descriptions of invalid bug reports to identify common patterns in their descriptions. We further investigated to what extent LDA can support this manual process. We used expert-based validation to evaluate the relevance of identified common patterns and their usefulness in suggesting preventive measures.<br/><br/>Results:<br/>We found that invalid bug reports have common patterns that are perceived as relevant, and they can be used to devise preventive measures. Furthermore, the identification of common patterns can be supported with automation.<br/><br/>Conclusion:<br/>Using LDA, practitioners can effectively identify representative groups of bug reports (i.e., relevant common patterns) from a large number of bug reports and analyze them further to devise preventive measures.}},
  author       = {{Laiq, Muhammad and Ali, Nauman Bin and Börstler, Jürgen and Engström, Emelie}},
  issn         = {{0950-5849}},
  language     = {{eng}},
  month        = {{12}},
  publisher    = {{Elsevier}},
  series       = {{Information and Software Technology}},
  title        = {{A data-driven approach for understanding invalid bug reports: An industrial case study}},
  url          = {{http://dx.doi.org/10.1016/j.infsof.2023.107305}},
  doi          = {{10.1016/j.infsof.2023.107305}},
  volume       = {{164}},
  year         = {{2023}},
}