Comparing Input Prioritization Techniques for Testing Deep Learning Algorithms

Vasilii Mosin; Miroslaw Staron; Darko Durisic; Francisco Gomes; Sushant Kumar Pandey; Ashok Chaitanya Koppisetty

doi:10.1109/SEAA56994.2022.00020

Comparing Input Prioritization Techniques for Testing Deep Learning Algorithms
Paper i proceeding, 2022

Deep learning (DL) systems are becoming an essential part of software systems, so it is necessary to test them thoroughly. This is a challenging task since the test sets can grow over time as the new data is being acquired, and it becomes time-consuming. Input prioritization is necessary to reduce the testing time since prioritized test inputs are more likely to reveal the erroneous behavior of a DL system earlier during test execution. Input prioritization approaches have been rudimentary analyzed against each other, this study compares different input prioritization techniques regarding their effectiveness and efficiency. This work considers surprise adequacy, autoencoder-based, and similarity-based input prioritization approaches in the example of testing a DL image classification algorithms applied on MNIST, Fashion-MNIST, CIFAR-10, and STL-10 datasets. To measure effectiveness and efficiency, we use a modified APFD (Average Percentage of Fault Detected), and set up & execution time, respectively. We observe that the surprise adequacy is the most effective (0.785 to 0.914 APFD). The autoencoder-based and similarity-based techniques are less effective, with the performance from 0.532 to 0.744 APFD and 0.579 to 0.709 APFD, respectively. In contrast, the similarity-based and surprise adequacy-based approaches are the most and least efficient, respectively. The findings in this work demonstrate the trade-off between the considered input prioritization techniques to understanding their practical applicability for testing DL algorithms.

test prioritization

diversity-based testing

deep learning

Författare

Vasilii Mosin

Volvo Group

Miroslaw Staron

Göteborgs universitet

Chalmers, Data- och informationsteknik, Software Engineering

Forskning Andra publikationer

Darko Durisic

Volvo Group

Forskning Andra publikationer

Francisco Gomes

Software Engineering 1

Göteborgs universitet

Forskning Andra publikationer

Sushant Kumar Pandey

Göteborgs universitet

Software Engineering 1

Forskning Andra publikationer

Ashok Chaitanya Koppisetty

Volvo Group

Proceedings - 48th Euromicro Conference on Software Engineering and Advanced Applications, SEAA 2022

76-83
9781665461528 (ISBN)

48th Euromicro Conference on Software Engineering and Advanced Applications, SEAA 2022
Gran Canaria, Spain,

Ämneskategorier (SSIF 2025)

Programvaruteknik

Datavetenskap (datalogi)

DOI

10.1109/SEAA56994.2022.00020

Publikationsdata kopplat till DOI

Mer information

Senast uppdaterat

2025-11-18

Comparing Input Prioritization Techniques for Testing Deep Learning Algorithms Paper i proceeding, 2022

Författare

Vasilii Mosin

Miroslaw Staron

Darko Durisic

Francisco Gomes

Sushant Kumar Pandey

Ashok Chaitanya Koppisetty

Proceedings - 48th Euromicro Conference on Software Engineering and Advanced Applications, SEAA 2022

Ämneskategorier (SSIF 2025)

DOI

Mer information

Senast uppdaterat

Comparing Input Prioritization Techniques for Testing Deep Learning Algorithms
Paper i proceeding, 2022