Enhancing glass-box methods for part-of-speech tagging and text classification
Licentiate thesis, 2025
The focus of this thesis is therefore on enhancing inherently transparent (glass-box) methods, thus establishing a foundation for their use in high-stakes scenarios. First, the task of POS tagging is considered. While often considered a solved problem, the findings here show that several challenges in POS tagging still remain. Based on these findings, a rule-based approach is explored for correcting the output of POS taggers. Second, interpretable text classification is studied with an enhanced linear classification method. The results demonstrate that a fully interpretable classifier can achieve a high performance when using the proposed enhancements, approaching that of pretrained DNN-based methods.
glass-box methods
part-of-speech tagging
interpretability
natural language processing
text classification
Author
Minerva Suvanto
Chalmers, Mechanics and Maritime Sciences (M2), Vehicle Engineering and Autonomous Systems
A Challenging Data Set for Evaluating Part-of-speech Taggers
International Conference on Agents and Artificial Intelligence,;Vol. 2(2024)p. 79-86
Paper in proceeding
Suvanto M., Wahde M., and Della Vedova M. L., Part-of-speech Taggers Make Errors on Unambiguous Sentences
An interpretable method for automated classification of spoken transcripts and written text
Evolutionary Intelligence,;Vol. 17(2024)p. 609-621
Journal article
Suvanto M. and Wahde M., Improving glass-box sentiment classification via feature set extension
Subject Categories (SSIF 2025)
Natural Language Processing
Publisher
Chalmers
HC1, Hörsalsvägen 14, Chalmers
Opponent: Dr. Peter Barclay, Edinburgh Napier University, Scotland