Multimodal Gene Expression Deep Learning for Predicting Sentinel Lymph Node Macro-metastasis in Early Breast Cancer: Development and Validation in the SCAN-B Cohort

Zhang, Daqu; Staaf, Johan; Bendahl, Pär-Ola; Dihge, Looket; Ohlsson, Mattias; Sjöström, Martin; Vallon-Christersson, Johan; Edén, Patrik; Ryden, Lisa

Multimodal Gene Expression Deep Learning for Predicting Sentinel Lymph Node Macro-metastasis in Early Breast Cancer: Development and Validation in the SCAN-B Cohort

Mark

Zhang, Daqu ^LU ; Staaf, Johan ^LU

; Bendahl, Pär-Ola ^LU ; Dihge, Looket ^LU ; Ohlsson, Mattias ^LU

; Sjöström, Martin ^LU ; Vallon-Christersson, Johan ^LU

; Edén, Patrik ^LU and Ryden, Lisa ^LU

(2026)

Abstract: Background: This study evaluates deep learning (DL) using gene expression (GEX) and preoperatively available clinical data (PreopClinic) to predict sentinel lymph node macro-metastasis (SLNM), and explores their potential for guiding axillary surgery de-escalation and supporting prognostic assessment.
Methods: We retrospectively included 6,836 clinically node-negative (cN0) T1-T2 patients with invasive breast cancer who underwent primary surgery from the Swedish SCAN-B cohort. Three DL models—a multilayer perceptron, a pathway-informed sparse neural network, and a transformer—were developed using the development set (n=4,625) and evaluated against XGBoost in the independent test set (n=2,211).
Results: The Transformer outperformed... (More); Background: This study evaluates deep learning (DL) using gene expression (GEX) and preoperatively available clinical data (PreopClinic) to predict sentinel lymph node macro-metastasis (SLNM), and explores their potential for guiding axillary surgery de-escalation and supporting prognostic assessment.
Methods: We retrospectively included 6,836 clinically node-negative (cN0) T1-T2 patients with invasive breast cancer who underwent primary surgery from the Swedish SCAN-B cohort. Three DL models—a multilayer perceptron, a pathway-informed sparse neural network, and a transformer—were developed using the development set (n=4,625) and evaluated against XGBoost in the independent test set (n=2,211).
Results: The Transformer outperformed other methods for GEX modeling and minimized the need for prior gene selection. In the independent test set, the combined PreopClinic+GEX model significantly improved SLNM prediction (ROC AUC 0.693, P¡0.001) and better identified low-risk patients who might avoid unnecessary SLNB (reduction rate 27.2% at a sensitivity of 92.1%, P=0.02) compared to the PreopClinic model alone. Notably, across-subtype training outperformed within-subtype training, improving nodal prediction, especially in TNBC (ROC AUC 0.734; 95% CI: 0.644-0.837), achieving a substantial SLNB reduction rate of 51.5% (95% CI: 43.2-59.9%). Importantly, the derived SLNM predictor showed prognostic significance (P=0.039), and provided complementary information to the established prognostic factors in the ER+HER2- patients recommended for SLNB under the 2025 ASCO guidelines.
Conclusion: These findings highlight the Transformer’s robustness against noise and effectiveness in capturing informative GEX features across scales, suggesting the potential of integrating GEX data and PreopClinic variables to enable further axillary surgical de-escalation, including for patients with tumor characteristics not reflected in current ASCO recommendations. (Less)

Please use this url to cite or link to this publication: https://lup.lub.lu.se/record/b5ed5b46-a4dc-4130-b205-41aa18f369dc

author

Zhang, Daqu ^LU ; Staaf, Johan ^LU

; Bendahl, Pär-Ola ^LU ; Dihge, Looket ^LU ; Ohlsson, Mattias ^LU

; Sjöström, Martin ^LU ; Vallon-Christersson, Johan ^LU

; Edén, Patrik ^LU and Ryden, Lisa ^LU

organization

publishing date

2026

type

Working paper/Preprint

publication status

published

subject

keywords

breast cancer, Axillary lymph node metastasis, sentinel lymph node biopsy, preoperative lymph node staging, gene expression, cancer pathway, deep learning, transformer

publisher

Research Square

DOI

10.21203/rs.3.rs-9281660/v1

language

English

LU publication?

yes

id

b5ed5b46-a4dc-4130-b205-41aa18f369dc

date added to LUP

2026-05-05 13:16:14

date last changed

2026-05-06 09:06:01

@misc{b5ed5b46-a4dc-4130-b205-41aa18f369dc,
  abstract     = {{Background: This study evaluates deep learning (DL) using gene expression (GEX) and preoperatively available clinical data (PreopClinic) to predict sentinel lymph node macro-metastasis (SLNM), and explores their potential for guiding axillary surgery de-escalation and supporting prognostic assessment.<br/>Methods: We retrospectively included 6,836 clinically node-negative (cN0) T1-T2 patients with invasive breast cancer who underwent primary surgery from the Swedish SCAN-B cohort. Three DL models—a multilayer perceptron, a pathway-informed sparse neural network, and a transformer—were developed using the development set (n=4,625) and evaluated against XGBoost in the independent test set (n=2,211).<br/>Results: The Transformer outperformed other methods for GEX modeling and minimized the need for prior gene selection. In the independent test set, the combined PreopClinic+GEX model significantly improved SLNM prediction (ROC AUC 0.693, P¡0.001) and better identified low-risk patients who might avoid unnecessary SLNB (reduction rate 27.2% at a sensitivity of 92.1%, P=0.02) compared to the PreopClinic model alone. Notably, across-subtype training outperformed within-subtype training, improving nodal prediction, especially in TNBC (ROC AUC 0.734; 95% CI: 0.644-0.837), achieving a substantial SLNB reduction rate of 51.5% (95% CI: 43.2-59.9%). Importantly, the derived SLNM predictor showed prognostic significance (P=0.039), and provided complementary information to the established prognostic factors in the ER+HER2- patients recommended for SLNB under the 2025 ASCO guidelines. <br/>Conclusion: These findings highlight the Transformer’s robustness against noise and effectiveness in capturing informative GEX features across scales, suggesting the potential of integrating GEX data and PreopClinic variables to enable further axillary surgical de-escalation, including for patients with tumor characteristics not reflected in current ASCO recommendations.}},
  author       = {{Zhang, Daqu and Staaf, Johan and Bendahl, Pär-Ola and Dihge, Looket and Ohlsson, Mattias and Sjöström, Martin and Vallon-Christersson, Johan and Edén, Patrik and Ryden, Lisa}},
  keywords     = {{breast cancer; Axillary lymph node metastasis; sentinel lymph node biopsy; preoperative lymph node staging; gene expression; cancer pathway; deep learning; transformer}},
  language     = {{eng}},
  note         = {{Preprint}},
  publisher    = {{Research Square}},
  title        = {{Multimodal Gene Expression Deep Learning for Predicting Sentinel Lymph Node Macro-metastasis in Early Breast Cancer: Development and Validation in the SCAN-B Cohort}},
  url          = {{http://dx.doi.org/10.21203/rs.3.rs-9281660/v1}},
  doi          = {{10.21203/rs.3.rs-9281660/v1}},
  year         = {{2026}},
}

Lund University Publications

LUND UNIVERSITY LIBRARIES

Multimodal Gene Expression Deep Learning for Predicting Sentinel Lymph Node Macro-metastasis in Early Breast Cancer: Development and Validation in the SCAN-B Cohort