Sustainable Selection of Machine Learning Algorithm for Gender-bias Attenuated Prediction
Artikel i vetenskaplig tidskrift, 2024
Research into novel approaches like Machine Learning (ML) promotes a new set of opportunities for sustainable development of applications through automation. However, there are certain ML tasks which are prone to spurious classification, mainly due to the bias in legacy data. One well-known and highly actual misclassification case concerns gender. As the vast dataset for engineering rules, standards and experiments are based on men, a bias towards women is the subject of research. Accordingly, any bias should be contained before the algorithms are deployed to the service of the sustainable society. There is a substantial amount of data on ML gender-bias in the literature. In these, the majority of the investigated cases are for ML branches like image or sound processing and text recognition. However, utilizing ML for driving style investigations is not an extensively researched area. In this work, a novel application for gender-based classification with bias-attenuation using anonymized driving data will be presented. Using data devoid of biometric and geographic information, the proposed pipeline distinguishes manifested binary genders with 80% accuracy for the drivers in the holdout data set. In addition, a method for sustainable algorithm selection and its extension to embedded applications, is proposed. An investigation into the environmental burden of seven different types of ML algorithms was conducted and the popular neural network algorithm had the highest environmental burden.
Machine learning
Driving style classification
Unsupervised/supervised learning
Energy consumption
Feature engineering
Battery electric vehicles