Enhancing basal cell carcinoma classification in preoperative biopsies via transfer learning with weakly supervised graph transformers
Artikel i vetenskaplig tidskrift, 2025

BackgroundBasal cell carcinoma (BCC) is the most common skin cancer, placing a significant burden on healthcare systems globally. Developing high-precision automated diagnostics requires large annotated datasets, which are costly and difficult to obtain. This study aimed to fine-tune a weakly supervised machine learning model to classify BCC in preoperative punch biopsies using transfer learning. By addressing challenges of scalability and variability, this approach seeks to enhance generalizability and diagnostic accuracy.MethodsThe Basal Cell Classification (BCCC) dataset included 514 WSIs of punch biopsies (261 with BCC and 253 tumor-free slides), divided into training (70%), validation (15%), and test sets (15%). WSIs were split into patches, and features were extracted using a pretrained simCLR model trained on 1,435 WSIs from BCC excisions. Features were formed into graphs for spatial information and the processed by a Vision Transformer. Testing included finetuned and non-finetuned pre-trained models as well as a model trained from the scratch, evaluated on 78 WSIs from the BCCC dataset. The COBRA dataset of 3,588 WSIs (1,794 with BCC and 1,794 without) was used for external validation. Models classified no-tumor vs. tumor (two classes), no-tumor vs. low-risk vs. high-risk tumors (three classes), and no-tumor vs. four BCC subtypes (five classes).ResultsThe fine-tuned model significantly outperformed the non-fine-tuned pretrained model and the model trained from the scratch with accuracies of 91.7%, 82.1%, and 75.3% and with AUCs of 0.98, 0.95-0.98, and 0.91-0.97 for two, three, and five-class classification. On the external validation, accuracies were 84.9% and 70.5%, with AUCs of 0.92 and 0.89-0.91 for two and three-class classification, respectively. The ablation study revealed that the fine-tuned model outperformed the model trained from scratch, improving mean accuracy by 10.6%, 11.7%, and 13.1% on the BCCC dataset, as well as by 29.6% and 19.2% on the COBRA dataset.ConclusionsThe results suggest that transfer learning not only enhances model performance on small datasets but also supports robust feature extraction in complex histopathology tasks. These findings reinforce the utility of pre-trained models in computational pathology, where access to large, labeled datasets is often limited, and task-specific challenges require nuanced understanding of the visual data.

Basal cell carcinoma

Transfer learning

Weakly supervised

Deep learning

Graph transformer

Graph convolutional network

Digital pathology

Författare

Johan Bjorkman

Göteborgs universitet

Student vid Chalmers

Sigrid Lagerroth

Göteborgs universitet

Jan Siarov

Göteborgs universitet

Filmon Yacob

Ekkono Solut

Noora Neittaanmaki

Göteborgs universitet

BMC Medical Imaging

14712342 (eISSN)

Vol. 25 1 166

Ämneskategorier (SSIF 2025)

Medicinsk bildvetenskap

Cancer och onkologi

DOI

10.1186/s12880-025-01710-4

PubMed

40380086

Relaterade dataset

Basal cell carcinoma classification [dataset]

DOI: https://doi.org/10.23698/aida/bccc

Mer information

Senast uppdaterat

2025-05-28