Classification of Testable and Valuable User Stories by using Supervised Machine Learning Classifiers

Ishan Mani Subedi; Maninder Singh; Vijayalakshmi Ramasamy; Gursimran Singh Walia

doi:10.1109/ISSREW53611.2021.00111

Classification of Testable and Valuable User Stories by using Supervised Machine Learning Classifiers

Ishan Mani Subedi, Maninder Singh, Vijayalakshmi Ramasamy, Gursimran Singh Walia

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

2 Scopus citations

Abstract

Agile is one of the most widely used software development methodologies that include user stories, the smallest units semi-structured specifications to capture the requirements from a user's point of view. Despite being popular, only a little research has been done to automate the quality checking/analysis of a user story before assigning it to a sprint. In this study, we have chosen two metrics, i.e., Testable and Valuable criteria from INVEST checklist, and have applied supervised machine learning classifiers to automatically classify them. Since the industrial data collected for the research was unbalanced, we also applied data balancing techniques such as SMOTE, RUS, ROS, and Back translation (BT) to verify if they improved any classification metrics. Although we did not see any significant improvements in accuracy and precision for the classifiers after applying data balancing techniques, we noticed a significant improvement in recall values across all the classifiers. Our research provides some promising insights into how this research could be used in the software industry to automate the analysis of user stories and improve the quality of software produced.

Original language	English (US)
Title of host publication	Proceedings - 2021 IEEE International Symposium on Software Reliability Engineering Workshops, ISSREW 2021
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	409-414
Number of pages	6
ISBN (Electronic)	9781665426039
DOIs	https://doi.org/10.1109/ISSREW53611.2021.00111
State	Published - 2021
Externally published	Yes
Event	32nd IEEE International Symposium on Software Reliability Engineering Workshops, ISSREW 2021 - Wuhan, China Duration: Oct 25 2021 → Oct 28 2021

Publication series

Name	Proceedings - 2021 IEEE International Symposium on Software Reliability Engineering Workshops, ISSREW 2021

Conference

Conference	32nd IEEE International Symposium on Software Reliability Engineering Workshops, ISSREW 2021
Country/Territory	China
City	Wuhan
Period	10/25/21 → 10/28/21

Keywords

Machine learning
Requirement Engineering and Quality
Text Augmentation
User Stories

ASJC Scopus subject areas

Software
Safety, Risk, Reliability and Quality

Access to Document

10.1109/ISSREW53611.2021.00111

Cite this

Subedi, I. M., Singh, M., Ramasamy, V., & Walia, G. S. (2021). Classification of Testable and Valuable User Stories by using Supervised Machine Learning Classifiers. In Proceedings - 2021 IEEE International Symposium on Software Reliability Engineering Workshops, ISSREW 2021 (pp. 409-414). (Proceedings - 2021 IEEE International Symposium on Software Reliability Engineering Workshops, ISSREW 2021). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ISSREW53611.2021.00111

Classification of Testable and Valuable User Stories by using Supervised Machine Learning Classifiers. / Subedi, Ishan Mani; Singh, Maninder; Ramasamy, Vijayalakshmi et al.
Proceedings - 2021 IEEE International Symposium on Software Reliability Engineering Workshops, ISSREW 2021. Institute of Electrical and Electronics Engineers Inc., 2021. p. 409-414 (Proceedings - 2021 IEEE International Symposium on Software Reliability Engineering Workshops, ISSREW 2021).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Subedi, IM, Singh, M, Ramasamy, V & Walia, GS 2021, Classification of Testable and Valuable User Stories by using Supervised Machine Learning Classifiers. in Proceedings - 2021 IEEE International Symposium on Software Reliability Engineering Workshops, ISSREW 2021. Proceedings - 2021 IEEE International Symposium on Software Reliability Engineering Workshops, ISSREW 2021, Institute of Electrical and Electronics Engineers Inc., pp. 409-414, 32nd IEEE International Symposium on Software Reliability Engineering Workshops, ISSREW 2021, Wuhan, China, 10/25/21. https://doi.org/10.1109/ISSREW53611.2021.00111

Subedi IM, Singh M, Ramasamy V, Walia GS. Classification of Testable and Valuable User Stories by using Supervised Machine Learning Classifiers. In Proceedings - 2021 IEEE International Symposium on Software Reliability Engineering Workshops, ISSREW 2021. Institute of Electrical and Electronics Engineers Inc. 2021. p. 409-414. (Proceedings - 2021 IEEE International Symposium on Software Reliability Engineering Workshops, ISSREW 2021). doi: 10.1109/ISSREW53611.2021.00111

Subedi, Ishan Mani ; Singh, Maninder ; Ramasamy, Vijayalakshmi et al. / Classification of Testable and Valuable User Stories by using Supervised Machine Learning Classifiers. Proceedings - 2021 IEEE International Symposium on Software Reliability Engineering Workshops, ISSREW 2021. Institute of Electrical and Electronics Engineers Inc., 2021. pp. 409-414 (Proceedings - 2021 IEEE International Symposium on Software Reliability Engineering Workshops, ISSREW 2021).

@inproceedings{a44bee89152847d0aca358146c54e2c9,

title = "Classification of Testable and Valuable User Stories by using Supervised Machine Learning Classifiers",

abstract = "Agile is one of the most widely used software development methodologies that include user stories, the smallest units semi-structured specifications to capture the requirements from a user's point of view. Despite being popular, only a little research has been done to automate the quality checking/analysis of a user story before assigning it to a sprint. In this study, we have chosen two metrics, i.e., Testable and Valuable criteria from INVEST checklist, and have applied supervised machine learning classifiers to automatically classify them. Since the industrial data collected for the research was unbalanced, we also applied data balancing techniques such as SMOTE, RUS, ROS, and Back translation (BT) to verify if they improved any classification metrics. Although we did not see any significant improvements in accuracy and precision for the classifiers after applying data balancing techniques, we noticed a significant improvement in recall values across all the classifiers. Our research provides some promising insights into how this research could be used in the software industry to automate the analysis of user stories and improve the quality of software produced.",

keywords = "Machine learning, Requirement Engineering and Quality, Text Augmentation, User Stories",

author = "Subedi, {Ishan Mani} and Maninder Singh and Vijayalakshmi Ramasamy and Walia, {Gursimran Singh}",

note = "Publisher Copyright: {\textcopyright} 2021 IEEE.; 32nd IEEE International Symposium on Software Reliability Engineering Workshops, ISSREW 2021 ; Conference date: 25-10-2021 Through 28-10-2021",

year = "2021",

doi = "10.1109/ISSREW53611.2021.00111",

language = "English (US)",

series = "Proceedings - 2021 IEEE International Symposium on Software Reliability Engineering Workshops, ISSREW 2021",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "409--414",

booktitle = "Proceedings - 2021 IEEE International Symposium on Software Reliability Engineering Workshops, ISSREW 2021",

}

TY - GEN

T1 - Classification of Testable and Valuable User Stories by using Supervised Machine Learning Classifiers

AU - Subedi, Ishan Mani

AU - Singh, Maninder

AU - Ramasamy, Vijayalakshmi

AU - Walia, Gursimran Singh

PY - 2021

Y1 - 2021

N2 - Agile is one of the most widely used software development methodologies that include user stories, the smallest units semi-structured specifications to capture the requirements from a user's point of view. Despite being popular, only a little research has been done to automate the quality checking/analysis of a user story before assigning it to a sprint. In this study, we have chosen two metrics, i.e., Testable and Valuable criteria from INVEST checklist, and have applied supervised machine learning classifiers to automatically classify them. Since the industrial data collected for the research was unbalanced, we also applied data balancing techniques such as SMOTE, RUS, ROS, and Back translation (BT) to verify if they improved any classification metrics. Although we did not see any significant improvements in accuracy and precision for the classifiers after applying data balancing techniques, we noticed a significant improvement in recall values across all the classifiers. Our research provides some promising insights into how this research could be used in the software industry to automate the analysis of user stories and improve the quality of software produced.

AB - Agile is one of the most widely used software development methodologies that include user stories, the smallest units semi-structured specifications to capture the requirements from a user's point of view. Despite being popular, only a little research has been done to automate the quality checking/analysis of a user story before assigning it to a sprint. In this study, we have chosen two metrics, i.e., Testable and Valuable criteria from INVEST checklist, and have applied supervised machine learning classifiers to automatically classify them. Since the industrial data collected for the research was unbalanced, we also applied data balancing techniques such as SMOTE, RUS, ROS, and Back translation (BT) to verify if they improved any classification metrics. Although we did not see any significant improvements in accuracy and precision for the classifiers after applying data balancing techniques, we noticed a significant improvement in recall values across all the classifiers. Our research provides some promising insights into how this research could be used in the software industry to automate the analysis of user stories and improve the quality of software produced.

KW - Machine learning

KW - Requirement Engineering and Quality

KW - Text Augmentation

KW - User Stories

UR - http://www.scopus.com/inward/record.url?scp=85126956890&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85126956890&partnerID=8YFLogxK

U2 - 10.1109/ISSREW53611.2021.00111

DO - 10.1109/ISSREW53611.2021.00111

M3 - Conference contribution

AN - SCOPUS:85126956890

T3 - Proceedings - 2021 IEEE International Symposium on Software Reliability Engineering Workshops, ISSREW 2021

SP - 409

EP - 414

BT - Proceedings - 2021 IEEE International Symposium on Software Reliability Engineering Workshops, ISSREW 2021

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 32nd IEEE International Symposium on Software Reliability Engineering Workshops, ISSREW 2021

Y2 - 25 October 2021 through 28 October 2021

ER -

Classification of Testable and Valuable User Stories by using Supervised Machine Learning Classifiers

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this