Do Take it Personal: It's Not What You Say, It's Who (and Where) You Are!
(2016) In Tiny Transactions on Computer Science 4.- Abstract
- Issue management in market-driven software projects is constantly under time pressure. A limited set of developers must share their time between developing features for the next release and resolving reported issues. Project managers need to find the appropriate balance between a high quality product and fast time to market. We study a telecom company in Sweden developing embedded systems for a consumer market. The project managers report that developers resolve approximately 10% of the issues reported during a project. Consequently, it is critical to properly prioritize the issues to receive the best possible return on investment, and above all to remove all bugs that might impact the market's reception of the product. We use machine... (More)
- Issue management in market-driven software projects is constantly under time pressure. A limited set of developers must share their time between developing features for the next release and resolving reported issues. Project managers need to find the appropriate balance between a high quality product and fast time to market. We study a telecom company in Sweden developing embedded systems for a consumer market. The project managers report that developers resolve approximately 10% of the issues reported during a project. Consequently, it is critical to properly prioritize the issues to receive the best possible return on investment, and above all to remove all bugs that might impact the market's reception of the product. We use machine learning to investigate what features of an issue report are the best predictors of changes to production code during its corresponding resolution. After removing all features jeopardizing the confidentiality of individual engineers, the issue reports are characterized by 19 features (apart from text). We extract 80,000 issue reports, an equal mix of positive and negative examples, and train a Bayesian Network classifier [2], obtaining 73% classification accuracy. Moreover, it reveals that the feature with the highest predictive value is from which physical site the issue was submitted. The general priority feature however, is only ranked 17 out of 19, whereas the submitting team is ranked 12. Our findings confirm a suspicion in the company: the priority set by the issue submitter is indeed a poor predictor of a future code change. (Less)
Please use this url to cite or link to this publication:
https://lup.lub.lu.se/record/8837852
- author
- Borg, Markus LU ; Gulin, Petter and Olofsson, Linus
- organization
- publishing date
- 2016
- type
- Contribution to journal
- publication status
- published
- subject
- in
- Tiny Transactions on Computer Science
- volume
- 4
- project
- Embedded Applications Software Engineering
- language
- English
- LU publication?
- yes
- id
- 42ce73db-4039-42f7-b7fc-dc635e51c8b8 (old id 8837852)
- alternative location
- http://tinytocs.ece.utexas.edu/papers/tinytocs4_paper_borg.pdf
- date added to LUP
- 2016-04-04 14:01:59
- date last changed
- 2021-04-29 09:44:29
@article{42ce73db-4039-42f7-b7fc-dc635e51c8b8, abstract = {{Issue management in market-driven software projects is constantly under time pressure. A limited set of developers must share their time between developing features for the next release and resolving reported issues. Project managers need to find the appropriate balance between a high quality product and fast time to market. We study a telecom company in Sweden developing embedded systems for a consumer market. The project managers report that developers resolve approximately 10% of the issues reported during a project. Consequently, it is critical to properly prioritize the issues to receive the best possible return on investment, and above all to remove all bugs that might impact the market's reception of the product. We use machine learning to investigate what features of an issue report are the best predictors of changes to production code during its corresponding resolution. After removing all features jeopardizing the confidentiality of individual engineers, the issue reports are characterized by 19 features (apart from text). We extract 80,000 issue reports, an equal mix of positive and negative examples, and train a Bayesian Network classifier [2], obtaining 73% classification accuracy. Moreover, it reveals that the feature with the highest predictive value is from which physical site the issue was submitted. The general priority feature however, is only ranked 17 out of 19, whereas the submitting team is ranked 12. Our findings confirm a suspicion in the company: the priority set by the issue submitter is indeed a poor predictor of a future code change.}}, author = {{Borg, Markus and Gulin, Petter and Olofsson, Linus}}, language = {{eng}}, series = {{Tiny Transactions on Computer Science}}, title = {{Do Take it Personal: It's Not What You Say, It's Who (and Where) You Are!}}, url = {{https://lup.lub.lu.se/search/files/6264356/8837853.pdf}}, volume = {{4}}, year = {{2016}}, }