Comparing Anomaly Detection and Classification Algorithms: A Case Study in Two Domains
Paper i proceeding, 2023

Utilizing large data sets in practical scenarios usually requires identifying, annotating and classifying rare events or anomalies. Although several methods exists, there are two classes of algorithms: anomaly detection algorithms and classification algorithms. Both types of algorithms have different characteristics and in this paper, we set out to compare them on two cases. We use data from a neurointensive care unit and from microwave radio transmissions. We apply Isolation Forest and Random Forest algorithms to find events in the data that occur with a frequency of ca. 1%. The results show that classification algorithms (Random Forest) perform better and can achieve up to 100% accuracy, while the anomaly detection algorithms (Isolation Forest) can achieve only 73% at best. Based on the results, we conclude that it is better to invest in annotating data á priori and use classification algorithms, despite the lower costs of using the anomaly detection algorithms.

Machine learning

telecommunication

neuro-intensive care

Författare

Miroslaw Staron

Chalmers, Data- och informationsteknik, Software Engineering

Göteborgs universitet

Helena Odenstedt Herges

Ericsson AB

Sahlgrenska universitetssjukhuset

Linda Block

Ericsson AB

Martin Sjödin

Ericsson AB

Lecture Notes in Business Information Processing

1865-1348 (ISSN) 18651356 (eISSN)

Vol. 472 LNBIP 121-136
9783031314872 (ISBN)

15th International Conference on Software Quality, SWQD 2023
Munich, Germany,

Ämneskategorier (SSIF 2011)

Datavetenskap (datalogi)

DOI

10.1007/978-3-031-31488-9_7

Mer information

Senast uppdaterat

2025-05-20