Reliable leukemia detection via transfer-enhanced Bayesian CNNs

Hita, Xhesina; Javed, Farrukh; Lodi, Stefano

Reliable leukemia detection via transfer-enhanced Bayesian CNNs

Mark

Hita, Xhesina ; Javed, Farrukh ^LU and Lodi, Stefano (2026) In Computers in Biology and Medicine 202.

Abstract: Accurate and early detection of Acute Lymphoblastic Leukemia (ALL) is critical for timely intervention and improved patient outcomes. However, the development of reliable deep learning models for hematological image analysis is challenged by limited data availability, dataset bias, and the need for trustworthy predictions in clinical settings. In this study, we propose a Bayesian deep learning framework that integrates transfer learning, data augmentation, and uncertainty quantification for robust classification of leukemic and healthy lymphocytes from peripheral blood smear images. Three widely used convolutional neural network architectures, InceptionV3, VGG16, and ResNet50, pretrained on ImageNet are fine-tuned on the ALL-IDB2... (More); Accurate and early detection of Acute Lymphoblastic Leukemia (ALL) is critical for timely intervention and improved patient outcomes. However, the development of reliable deep learning models for hematological image analysis is challenged by limited data availability, dataset bias, and the need for trustworthy predictions in clinical settings. In this study, we propose a Bayesian deep learning framework that integrates transfer learning, data augmentation, and uncertainty quantification for robust classification of leukemic and healthy lymphocytes from peripheral blood smear images. Three widely used convolutional neural network architectures, InceptionV3, VGG16, and ResNet50, pretrained on ImageNet are fine-tuned on the ALL-IDB2 dataset and extended with Monte Carlo dropout to enable Bayesian inference. Model performance is evaluated using 10-fold cross-validation on both original and augmented datasets, with accuracy, sensitivity, specificity, Youden’s index, and Brier score used as evaluation metrics. Among the evaluated models, VGG16 demonstrates the most consistent improvements under data augmentation, achieving the highest accuracy (98.65%±0.09), Youden’s index (0.97±0.001) and Brier score (0.035±0.010), while ResNet50 shows strong but more moderate gains. In contrast, InceptionV3 exhibits limited sensitivity to augmentation and comparatively lower robustness. Beyond average predictive performance, Bayesian uncertainty analysis reveals that misclassifications and borderline predictions are consistently associated with elevated predictive entropy and mutual information. Saliency map inspection further indicates that high-uncertainty cases correspond to diffuse or non-localized attention patterns, suggesting reliance on spurious contextual features rather than stable morphological cues. These findings highlight the importance of uncertainty-aware predictions for identifying cases that may require expert pathological review. Overall, the proposed framework combines strong diagnostic performance with interpretable uncertainty estimates, supporting its role as a transparent and clinically trustworthy tool for AI-assisted leukemia screening.
(Less)

Please use this url to cite or link to this publication: https://lup.lub.lu.se/record/7258175f-3a0c-4863-8ff1-fede7c40bbc3

author

Hita, Xhesina ; Javed, Farrukh ^LU and Lodi, Stefano

organization

publishing date

2026-02-01

type

Contribution to journal

publication status

published

subject

Medical Imaging

keywords

Acute lymphoblastic leukemia, Bayesian CNN, Deep learning, Medical image classification, Transfer learning, Uncertainty quantification

in

Computers in Biology and Medicine

volume

202

article number

111419

publisher

Elsevier

external identifiers

scopus:105026445354
pmid:41485393

ISSN

0010-4825

DOI

10.1016/j.compbiomed.2025.111419

language

English

LU publication?

yes

id

7258175f-3a0c-4863-8ff1-fede7c40bbc3

date added to LUP

2026-03-09 13:43:15

date last changed

2026-05-20 02:26:21

@article{7258175f-3a0c-4863-8ff1-fede7c40bbc3,
  abstract     = {{<p>Accurate and early detection of Acute Lymphoblastic Leukemia (ALL) is critical for timely intervention and improved patient outcomes. However, the development of reliable deep learning models for hematological image analysis is challenged by limited data availability, dataset bias, and the need for trustworthy predictions in clinical settings. In this study, we propose a Bayesian deep learning framework that integrates transfer learning, data augmentation, and uncertainty quantification for robust classification of leukemic and healthy lymphocytes from peripheral blood smear images. Three widely used convolutional neural network architectures, InceptionV3, VGG16, and ResNet50, pretrained on ImageNet are fine-tuned on the ALL-IDB2 dataset and extended with Monte Carlo dropout to enable Bayesian inference. Model performance is evaluated using 10-fold cross-validation on both original and augmented datasets, with accuracy, sensitivity, specificity, Youden’s index, and Brier score used as evaluation metrics. Among the evaluated models, VGG16 demonstrates the most consistent improvements under data augmentation, achieving the highest accuracy (98.65%±0.09), Youden’s index (0.97±0.001) and Brier score (0.035±0.010), while ResNet50 shows strong but more moderate gains. In contrast, InceptionV3 exhibits limited sensitivity to augmentation and comparatively lower robustness. Beyond average predictive performance, Bayesian uncertainty analysis reveals that misclassifications and borderline predictions are consistently associated with elevated predictive entropy and mutual information. Saliency map inspection further indicates that high-uncertainty cases correspond to diffuse or non-localized attention patterns, suggesting reliance on spurious contextual features rather than stable morphological cues. These findings highlight the importance of uncertainty-aware predictions for identifying cases that may require expert pathological review. Overall, the proposed framework combines strong diagnostic performance with interpretable uncertainty estimates, supporting its role as a transparent and clinically trustworthy tool for AI-assisted leukemia screening.</p>}},
  author       = {{Hita, Xhesina and Javed, Farrukh and Lodi, Stefano}},
  issn         = {{0010-4825}},
  keywords     = {{Acute lymphoblastic leukemia; Bayesian CNN; Deep learning; Medical image classification; Transfer learning; Uncertainty quantification}},
  language     = {{eng}},
  month        = {{02}},
  publisher    = {{Elsevier}},
  series       = {{Computers in Biology and Medicine}},
  title        = {{Reliable leukemia detection via transfer-enhanced Bayesian CNNs}},
  url          = {{http://dx.doi.org/10.1016/j.compbiomed.2025.111419}},
  doi          = {{10.1016/j.compbiomed.2025.111419}},
  volume       = {{202}},
  year         = {{2026}},
}

Lund University Publications

LUND UNIVERSITY LIBRARIES

Reliable leukemia detection via transfer-enhanced Bayesian CNNs