Skip to main content

Lund University Publications

LUND UNIVERSITY LIBRARIES

Industrial adoption of machine learning techniques for early identification of invalid bug reports

Laiq, Muhammad ; bin Ali, Nauman ; Börstler, Jürgen and Engström, Emelie LU orcid (2024) In Empirical Software Engineering
Abstract (Swedish)
Despite the accuracy of machine learning (ML) techniques in predict-
ing invalid bug reports, as shown in earlier research, and the importance of early
identification of invalid bug reports in software maintenance, the adoption of ML
techniques for this task in industrial practice is yet to be investigated. In this study,
we used a technology transfer model to guide the adoption of an ML technique at a
company for the early identification of invalid bug reports. In the process, we also
identify necessary conditions for adopting such techniques in practice. We followed
a case study research approach with various design and analysis iterations for tech-
nology transfer activities. We collected data from bug... (More)
Despite the accuracy of machine learning (ML) techniques in predict-
ing invalid bug reports, as shown in earlier research, and the importance of early
identification of invalid bug reports in software maintenance, the adoption of ML
techniques for this task in industrial practice is yet to be investigated. In this study,
we used a technology transfer model to guide the adoption of an ML technique at a
company for the early identification of invalid bug reports. In the process, we also
identify necessary conditions for adopting such techniques in practice. We followed
a case study research approach with various design and analysis iterations for tech-
nology transfer activities. We collected data from bug repositories, through focus
groups, a questionnaire, and a presentation and feedback session with an expert.
As expected, we found that an ML technique can identify invalid bug reports with
acceptable accuracy at an early stage. However, the technique’s accuracy drops
over time in its operational use due to changes in the product, the used technolo-
gies, or the development organization. Such changes may require retraining the
ML model. During validation, practitioners highlighted the need to understand
the ML technique’s predictions to trust the predictions. We found that a visual
(using a state-of-the-art ML interpretation framework) and descriptive explana-
tion of the prediction increases the trustability of the technique compared to just
presenting the results of the validity predictions. We conclude that trustability,
integration with the existing toolchain, and maintaining the techniques’ accuracy
over time are critical for increasing the likelihood of adoption. (Less)
Please use this url to cite or link to this publication:
author
; ; and
organization
publishing date
type
Contribution to journal
publication status
in press
subject
in
Empirical Software Engineering
publisher
Springer
ISSN
1573-7616
language
English
LU publication?
yes
id
4ff7db63-79f9-4f62-a34e-269b6a3648ce
date added to LUP
2024-05-21 09:28:50
date last changed
2024-05-21 13:51:16
@article{4ff7db63-79f9-4f62-a34e-269b6a3648ce,
  abstract     = {{Despite the accuracy of machine learning (ML) techniques in predict-<br/>ing invalid bug reports, as shown in earlier research, and the importance of early<br/>identification of invalid bug reports in software maintenance, the adoption of ML<br/>techniques for this task in industrial practice is yet to be investigated. In this study,<br/>we used a technology transfer model to guide the adoption of an ML technique at a<br/>company for the early identification of invalid bug reports. In the process, we also<br/>identify necessary conditions for adopting such techniques in practice. We followed<br/>a case study research approach with various design and analysis iterations for tech-<br/>nology transfer activities. We collected data from bug repositories, through focus<br/>groups, a questionnaire, and a presentation and feedback session with an expert.<br/>As expected, we found that an ML technique can identify invalid bug reports with<br/>acceptable accuracy at an early stage. However, the technique’s accuracy drops<br/>over time in its operational use due to changes in the product, the used technolo-<br/>gies, or the development organization. Such changes may require retraining the<br/>ML model. During validation, practitioners highlighted the need to understand<br/>the ML technique’s predictions to trust the predictions. We found that a visual<br/>(using a state-of-the-art ML interpretation framework) and descriptive explana-<br/>tion of the prediction increases the trustability of the technique compared to just<br/>presenting the results of the validity predictions. We conclude that trustability,<br/>integration with the existing toolchain, and maintaining the techniques’ accuracy<br/>over time are critical for increasing the likelihood of adoption.}},
  author       = {{Laiq, Muhammad and bin Ali, Nauman and Börstler, Jürgen and Engström, Emelie}},
  issn         = {{1573-7616}},
  language     = {{eng}},
  publisher    = {{Springer}},
  series       = {{Empirical Software Engineering}},
  title        = {{Industrial adoption of machine learning techniques for early identification of invalid bug reports}},
  year         = {{2024}},
}