A classification of code changes and test types dependencies for improving machine learning based test selection

Khaled Al Sabbagh; Miroslaw Staron; Regina Hebig; Francisco Gomes

doi:10.1145/3475960.3475987

A classification of code changes and test types dependencies for improving machine learning based test selection
Paper in proceeding, 2021

Machine learning has been increasingly used to solve various software engineering tasks. One example of their usage is in regression testing, where a classifier is built using historical code commits to predict which test cases require execution. In this paper, we address the problem of how to link specific code commits to test types to improve the predictive performance of learning models in improving regression testing. We design a dependency taxonomy of the content of committed code and the type of a test case. The taxonomy focuses on two types of code commits: changing memory management and algorithm complexity. We reviewed the literature, surveyed experienced testers from three Swedish-based software companies, and conducted a workshop to develop the taxonomy. The derived taxonomy shows that memory management code should be tested with tests related to performance, load, soak, stress, volume, and capacity; the complexity changes should be tested with the same dedicated tests and maintainability tests. We conclude that this taxonomy can improve the effectiveness of building learning models for regression testing.

taxonomy

testing

continuous integration

Author

Khaled Al Sabbagh

Software Engineering 1

University of Gothenburg

Other publications Research

Miroslaw Staron

Chalmers, Computer Science and Engineering (Chalmers), Software Engineering (Chalmers)

University of Gothenburg

Other publications Research

Regina Hebig

University of Gothenburg

Software Engineering 1

Other publications Research

Francisco Gomes

Interaction Design and Software Engineering

University of Gothenburg

Other publications Research

SIGPLAN Notices (ACM Special Interest Group on Programming Languages)

15420205 (ISSN)

40-49
9781450386807 (ISBN)

the 17th International Conference on Predictive Models and Data Analytics in Software Engineering
Athens, Greece,

Subject Categories (SSIF 2011)

Information Science

Computer Science

Computer Systems

DOI

10.1145/3475960.3475987

Publication data connected to DOI

More information

Latest update

5/19/2025

A classification of code changes and test types dependencies for improving machine learning based test selection Paper in proceeding, 2021

Author

Khaled Al Sabbagh

Miroslaw Staron

Regina Hebig

Francisco Gomes

SIGPLAN Notices (ACM Special Interest Group on Programming Languages)

Subject Categories (SSIF 2011)

DOI

More information

Latest update

A classification of code changes and test types dependencies for improving machine learning based test selection
Paper in proceeding, 2021