Industrial adoption of machine learning techniques for early identification of invalid bug reports
(2024) In Empirical Software Engineering- Abstract (Swedish)
- Despite the accuracy of machine learning (ML) techniques in predict-
ing invalid bug reports, as shown in earlier research, and the importance of early
identification of invalid bug reports in software maintenance, the adoption of ML
techniques for this task in industrial practice is yet to be investigated. In this study,
we used a technology transfer model to guide the adoption of an ML technique at a
company for the early identification of invalid bug reports. In the process, we also
identify necessary conditions for adopting such techniques in practice. We followed
a case study research approach with various design and analysis iterations for tech-
nology transfer activities. We collected data from bug... (More) - Despite the accuracy of machine learning (ML) techniques in predict-
ing invalid bug reports, as shown in earlier research, and the importance of early
identification of invalid bug reports in software maintenance, the adoption of ML
techniques for this task in industrial practice is yet to be investigated. In this study,
we used a technology transfer model to guide the adoption of an ML technique at a
company for the early identification of invalid bug reports. In the process, we also
identify necessary conditions for adopting such techniques in practice. We followed
a case study research approach with various design and analysis iterations for tech-
nology transfer activities. We collected data from bug repositories, through focus
groups, a questionnaire, and a presentation and feedback session with an expert.
As expected, we found that an ML technique can identify invalid bug reports with
acceptable accuracy at an early stage. However, the technique’s accuracy drops
over time in its operational use due to changes in the product, the used technolo-
gies, or the development organization. Such changes may require retraining the
ML model. During validation, practitioners highlighted the need to understand
the ML technique’s predictions to trust the predictions. We found that a visual
(using a state-of-the-art ML interpretation framework) and descriptive explana-
tion of the prediction increases the trustability of the technique compared to just
presenting the results of the validity predictions. We conclude that trustability,
integration with the existing toolchain, and maintaining the techniques’ accuracy
over time are critical for increasing the likelihood of adoption. (Less)
Please use this url to cite or link to this publication:
https://lup.lub.lu.se/record/4ff7db63-79f9-4f62-a34e-269b6a3648ce
- author
- Laiq, Muhammad ; bin Ali, Nauman ; Börstler, Jürgen and Engström, Emelie LU
- organization
- publishing date
- 2024
- type
- Contribution to journal
- publication status
- in press
- subject
- in
- Empirical Software Engineering
- publisher
- Springer
- ISSN
- 1573-7616
- language
- English
- LU publication?
- yes
- id
- 4ff7db63-79f9-4f62-a34e-269b6a3648ce
- date added to LUP
- 2024-05-21 09:28:50
- date last changed
- 2024-05-21 13:51:16
@article{4ff7db63-79f9-4f62-a34e-269b6a3648ce, abstract = {{Despite the accuracy of machine learning (ML) techniques in predict-<br/>ing invalid bug reports, as shown in earlier research, and the importance of early<br/>identification of invalid bug reports in software maintenance, the adoption of ML<br/>techniques for this task in industrial practice is yet to be investigated. In this study,<br/>we used a technology transfer model to guide the adoption of an ML technique at a<br/>company for the early identification of invalid bug reports. In the process, we also<br/>identify necessary conditions for adopting such techniques in practice. We followed<br/>a case study research approach with various design and analysis iterations for tech-<br/>nology transfer activities. We collected data from bug repositories, through focus<br/>groups, a questionnaire, and a presentation and feedback session with an expert.<br/>As expected, we found that an ML technique can identify invalid bug reports with<br/>acceptable accuracy at an early stage. However, the technique’s accuracy drops<br/>over time in its operational use due to changes in the product, the used technolo-<br/>gies, or the development organization. Such changes may require retraining the<br/>ML model. During validation, practitioners highlighted the need to understand<br/>the ML technique’s predictions to trust the predictions. We found that a visual<br/>(using a state-of-the-art ML interpretation framework) and descriptive explana-<br/>tion of the prediction increases the trustability of the technique compared to just<br/>presenting the results of the validity predictions. We conclude that trustability,<br/>integration with the existing toolchain, and maintaining the techniques’ accuracy<br/>over time are critical for increasing the likelihood of adoption.}}, author = {{Laiq, Muhammad and bin Ali, Nauman and Börstler, Jürgen and Engström, Emelie}}, issn = {{1573-7616}}, language = {{eng}}, publisher = {{Springer}}, series = {{Empirical Software Engineering}}, title = {{Industrial adoption of machine learning techniques for early identification of invalid bug reports}}, year = {{2024}}, }