Multimodal Gene Expression Deep Learning for Predicting Sentinel Lymph Node Macro-metastasis in Early Breast Cancer: Development and Validation in the SCAN-B Cohort
(2026)- Abstract
- Background: This study evaluates deep learning (DL) using gene expression (GEX) and preoperatively available clinical data (PreopClinic) to predict sentinel lymph node macro-metastasis (SLNM), and explores their potential for guiding axillary surgery de-escalation and supporting prognostic assessment.
Methods: We retrospectively included 6,836 clinically node-negative (cN0) T1-T2 patients with invasive breast cancer who underwent primary surgery from the Swedish SCAN-B cohort. Three DL models—a multilayer perceptron, a pathway-informed sparse neural network, and a transformer—were developed using the development set (n=4,625) and evaluated against XGBoost in the independent test set (n=2,211).
Results: The Transformer outperformed... (More) - Background: This study evaluates deep learning (DL) using gene expression (GEX) and preoperatively available clinical data (PreopClinic) to predict sentinel lymph node macro-metastasis (SLNM), and explores their potential for guiding axillary surgery de-escalation and supporting prognostic assessment.
Methods: We retrospectively included 6,836 clinically node-negative (cN0) T1-T2 patients with invasive breast cancer who underwent primary surgery from the Swedish SCAN-B cohort. Three DL models—a multilayer perceptron, a pathway-informed sparse neural network, and a transformer—were developed using the development set (n=4,625) and evaluated against XGBoost in the independent test set (n=2,211).
Results: The Transformer outperformed other methods for GEX modeling and minimized the need for prior gene selection. In the independent test set, the combined PreopClinic+GEX model significantly improved SLNM prediction (ROC AUC 0.693, P¡0.001) and better identified low-risk patients who might avoid unnecessary SLNB (reduction rate 27.2% at a sensitivity of 92.1%, P=0.02) compared to the PreopClinic model alone. Notably, across-subtype training outperformed within-subtype training, improving nodal prediction, especially in TNBC (ROC AUC 0.734; 95% CI: 0.644-0.837), achieving a substantial SLNB reduction rate of 51.5% (95% CI: 43.2-59.9%). Importantly, the derived SLNM predictor showed prognostic significance (P=0.039), and provided complementary information to the established prognostic factors in the ER+HER2- patients recommended for SLNB under the 2025 ASCO guidelines.
Conclusion: These findings highlight the Transformer’s robustness against noise and effectiveness in capturing informative GEX features across scales, suggesting the potential of integrating GEX data and PreopClinic variables to enable further axillary surgical de-escalation, including for patients with tumor characteristics not reflected in current ASCO recommendations. (Less)
Please use this url to cite or link to this publication:
https://lup.lub.lu.se/record/b5ed5b46-a4dc-4130-b205-41aa18f369dc
- author
- Zhang, Daqu
LU
; Staaf, Johan
LU
; Bendahl, Pär-Ola
LU
; Dihge, Looket
LU
; Ohlsson, Mattias
LU
; Sjöström, Martin
LU
; Vallon-Christersson, Johan
LU
; Edén, Patrik
LU
and Ryden, Lisa
LU
- organization
-
- Centre for Environmental and Climate Science (CEC)
- Computational Science for Health and Environment (research group)
- Breast cancer treatment
- The Liquid Biopsy and Tumor Progression in Breast Cancer (research group)
- Department of Earth and Environmental Sciences (MGeo)
- Breast/lung cancer (research group)
- Division of Translational Cancer Research
- Breast/lungcancer
- LUCC: Lund University Cancer Centre
- Research Group Lung Cancer (research group)
- Surgery (research group)
- Breast Cancer Surgery (research group)
- eSSENCE: The e-Science Collaboration
- SCAN-B (research group)
- Breastcancer-genetics
- Surgery (Lund)
- publishing date
- 2026
- type
- Working paper/Preprint
- publication status
- published
- subject
- keywords
- breast cancer, Axillary lymph node metastasis, sentinel lymph node biopsy, preoperative lymph node staging, gene expression, cancer pathway, deep learning, transformer
- publisher
- Research Square
- DOI
- 10.21203/rs.3.rs-9281660/v1
- language
- English
- LU publication?
- yes
- id
- b5ed5b46-a4dc-4130-b205-41aa18f369dc
- date added to LUP
- 2026-05-05 13:16:14
- date last changed
- 2026-05-06 09:06:01
@misc{b5ed5b46-a4dc-4130-b205-41aa18f369dc,
abstract = {{Background: This study evaluates deep learning (DL) using gene expression (GEX) and preoperatively available clinical data (PreopClinic) to predict sentinel lymph node macro-metastasis (SLNM), and explores their potential for guiding axillary surgery de-escalation and supporting prognostic assessment.<br/>Methods: We retrospectively included 6,836 clinically node-negative (cN0) T1-T2 patients with invasive breast cancer who underwent primary surgery from the Swedish SCAN-B cohort. Three DL models—a multilayer perceptron, a pathway-informed sparse neural network, and a transformer—were developed using the development set (n=4,625) and evaluated against XGBoost in the independent test set (n=2,211).<br/>Results: The Transformer outperformed other methods for GEX modeling and minimized the need for prior gene selection. In the independent test set, the combined PreopClinic+GEX model significantly improved SLNM prediction (ROC AUC 0.693, P¡0.001) and better identified low-risk patients who might avoid unnecessary SLNB (reduction rate 27.2% at a sensitivity of 92.1%, P=0.02) compared to the PreopClinic model alone. Notably, across-subtype training outperformed within-subtype training, improving nodal prediction, especially in TNBC (ROC AUC 0.734; 95% CI: 0.644-0.837), achieving a substantial SLNB reduction rate of 51.5% (95% CI: 43.2-59.9%). Importantly, the derived SLNM predictor showed prognostic significance (P=0.039), and provided complementary information to the established prognostic factors in the ER+HER2- patients recommended for SLNB under the 2025 ASCO guidelines. <br/>Conclusion: These findings highlight the Transformer’s robustness against noise and effectiveness in capturing informative GEX features across scales, suggesting the potential of integrating GEX data and PreopClinic variables to enable further axillary surgical de-escalation, including for patients with tumor characteristics not reflected in current ASCO recommendations.}},
author = {{Zhang, Daqu and Staaf, Johan and Bendahl, Pär-Ola and Dihge, Looket and Ohlsson, Mattias and Sjöström, Martin and Vallon-Christersson, Johan and Edén, Patrik and Ryden, Lisa}},
keywords = {{breast cancer; Axillary lymph node metastasis; sentinel lymph node biopsy; preoperative lymph node staging; gene expression; cancer pathway; deep learning; transformer}},
language = {{eng}},
note = {{Preprint}},
publisher = {{Research Square}},
title = {{Multimodal Gene Expression Deep Learning for Predicting Sentinel Lymph Node Macro-metastasis in Early Breast Cancer: Development and Validation in the SCAN-B Cohort}},
url = {{http://dx.doi.org/10.21203/rs.3.rs-9281660/v1}},
doi = {{10.21203/rs.3.rs-9281660/v1}},
year = {{2026}},
}