Comparing Anomaly Detection and Classification Algorithms: A Case Study in Two Domains
Paper in proceeding, 2023

Utilizing large data sets in practical scenarios usually requires identifying, annotating and classifying rare events or anomalies. Although several methods exists, there are two classes of algorithms: anomaly detection algorithms and classification algorithms. Both types of algorithms have different characteristics and in this paper, we set out to compare them on two cases. We use data from a neurointensive care unit and from microwave radio transmissions. We apply Isolation Forest and Random Forest algorithms to find events in the data that occur with a frequency of ca. 1%. The results show that classification algorithms (Random Forest) perform better and can achieve up to 100% accuracy, while the anomaly detection algorithms (Isolation Forest) can achieve only 73% at best. Based on the results, we conclude that it is better to invest in annotating data รก priori and use classification algorithms, despite the lower costs of using the anomaly detection algorithms.

neuro-intensive care

Machine learning

telecommunication

Author

Miroslaw Staron

University of Gothenburg

Helena Odenstedt Herges

Sahlgrenska University Hospital

Ericsson

Linda Block

Ericsson

Martin Sjödin

Ericsson

Lecture Notes in Business Information Processing

1865-1348 (ISSN) 18651356 (eISSN)

Vol. 472 LNBIP 121-136
9783031314872 (ISBN)

15th International Conference on Software Quality, SWQD 2023
Munich, Germany,

Subject Categories

Computer Science

DOI

10.1007/978-3-031-31488-9_7

More information

Latest update

7/19/2023