Comparing Anomaly Detection and Classification Algorithms: A Case Study in Two Domains
Paper i proceeding, 2023

Utilizing large data sets in practical scenarios usually requires identifying, annotating and classifying rare events or anomalies. Although several methods exists, there are two classes of algorithms: anomaly detection algorithms and classification algorithms. Both types of algorithms have different characteristics and in this paper, we set out to compare them on two cases. We use data from a neurointensive care unit and from microwave radio transmissions. We apply Isolation Forest and Random Forest algorithms to find events in the data that occur with a frequency of ca. 1%. The results show that classification algorithms (Random Forest) perform better and can achieve up to 100% accuracy, while the anomaly detection algorithms (Isolation Forest) can achieve only 73% at best. Based on the results, we conclude that it is better to invest in annotating data á priori and use classification algorithms, despite the lower costs of using the anomaly detection algorithms.

neuro-intensive care

Machine learning

telecommunication

Författare

Miroslaw Staron

Göteborgs universitet

Helena Odenstedt Herges

Sahlgrenska universitetssjukhuset

Ericsson AB

Linda Block

Ericsson AB

Martin Sjödin

Ericsson AB

Lecture Notes in Business Information Processing

1865-1348 (ISSN) 18651356 (eISSN)

Vol. 472 LNBIP 121-136
9783031314872 (ISBN)

15th International Conference on Software Quality, SWQD 2023
Munich, Germany,

Ämneskategorier (SSIF 2011)

Datavetenskap (datalogi)

DOI

10.1007/978-3-031-31488-9_7

Mer information

Senast uppdaterat

2023-07-19