Prediction of gene expression-based breast cancer proliferation scores from histopathology whole slide images using deep learning

Ekholm, Andreas; Wang, Yinxi; Vallon-Christersson, Johan; Boissin, Constance; Rantalainen, Mattias

Prediction of gene expression-based breast cancer proliferation scores from histopathology whole slide images using deep learning

Mark

Ekholm, Andreas ; Wang, Yinxi ; Vallon-Christersson, Johan ^LU

; Boissin, Constance and Rantalainen, Mattias (2024) In BMC Cancer 24(1).

Abstract: Background: In breast cancer, several gene expression assays have been developed to provide a more personalised treatment. This study focuses on the prediction of two molecular proliferation signatures: an 11-gene proliferation score and the MKI67 proliferation marker gene. The aim was to assess whether these could be predicted from digital whole slide images (WSIs) using deep learning models. Methods: WSIs and RNA-sequencing data from 819 invasive breast cancer patients were included for training, and models were evaluated on an internal test set of 172 cases as well as on 997 cases from a fully independent external test set. Two deep Convolutional Neural Network (CNN) models were optimised using WSIs and gene expression readouts from... (More); Background: In breast cancer, several gene expression assays have been developed to provide a more personalised treatment. This study focuses on the prediction of two molecular proliferation signatures: an 11-gene proliferation score and the MKI67 proliferation marker gene. The aim was to assess whether these could be predicted from digital whole slide images (WSIs) using deep learning models. Methods: WSIs and RNA-sequencing data from 819 invasive breast cancer patients were included for training, and models were evaluated on an internal test set of 172 cases as well as on 997 cases from a fully independent external test set. Two deep Convolutional Neural Network (CNN) models were optimised using WSIs and gene expression readouts from RNA-sequencing data of either the proliferation signature or the proliferation marker, and assessed using Spearman correlation (r). Prognostic performance was assessed through Cox proportional hazard modelling, estimating hazard ratios (HR). Results: Optimised CNNs successfully predicted the proliferation score and proliferation marker on the unseen internal test set (ρ = 0.691(p < 0.001) with R² = 0.438, and ρ = 0.564 (p < 0.001) with R² = 0.251 respectively) and on the external test set (ρ = 0.502 (p < 0.001) with R² = 0.319, and ρ = 0.403 (p < 0.001) with R² = 0.222 respectively). Patients with a high proliferation score or marker were significantly associated with a higher risk of recurrence or death in the external test set (HR = 1.65 (95% CI: 1.05–2.61) and HR = 1.84 (95% CI: 1.17–2.89), respectively). Conclusions: The results from this study suggest that gene expression levels of proliferation scores can be predicted directly from breast cancer morphology in WSIs using CNNs and that the predictions provide prognostic information that could be used in research as well as in the clinical setting.
(Less)

Please use this url to cite or link to this publication: https://lup.lub.lu.se/record/b5a8bfa6-f9ba-4d31-a202-5a4dcc43758a

author

Ekholm, Andreas ; Wang, Yinxi ; Vallon-Christersson, Johan ^LU

; Boissin, Constance and Rantalainen, Mattias

organization

publishing date

2024-12

type

Contribution to journal

publication status

published

subject

Cancer and Oncology

keywords

Artificial intelligence, Breast cancer, Computational pathology, Gene expression, Proliferation

in

BMC Cancer

volume

24

issue

1

article number

1510

publisher

BioMed Central (BMC)

external identifiers

scopus:85211817292
pmid:39663527

ISSN

1471-2407

DOI

10.1186/s12885-024-13248-9

language

English

LU publication?

yes

id

b5a8bfa6-f9ba-4d31-a202-5a4dcc43758a

date added to LUP

2025-01-22 10:58:15

date last changed

2026-02-05 19:45:52

@article{b5a8bfa6-f9ba-4d31-a202-5a4dcc43758a,
  abstract     = {{<p>Background: In breast cancer, several gene expression assays have been developed to provide a more personalised treatment. This study focuses on the prediction of two molecular proliferation signatures: an 11-gene proliferation score and the MKI67 proliferation marker gene. The aim was to assess whether these could be predicted from digital whole slide images (WSIs) using deep learning models. Methods: WSIs and RNA-sequencing data from 819 invasive breast cancer patients were included for training, and models were evaluated on an internal test set of 172 cases as well as on 997 cases from a fully independent external test set. Two deep Convolutional Neural Network (CNN) models were optimised using WSIs and gene expression readouts from RNA-sequencing data of either the proliferation signature or the proliferation marker, and assessed using Spearman correlation (r). Prognostic performance was assessed through Cox proportional hazard modelling, estimating hazard ratios (HR). Results: Optimised CNNs successfully predicted the proliferation score and proliferation marker on the unseen internal test set (ρ = 0.691(p &lt; 0.001) with R<sup>2</sup> = 0.438, and ρ = 0.564 (p &lt; 0.001) with R<sup>2</sup> = 0.251 respectively) and on the external test set (ρ = 0.502 (p &lt; 0.001) with R<sup>2</sup> = 0.319, and ρ = 0.403 (p &lt; 0.001) with R<sup>2</sup> = 0.222 respectively). Patients with a high proliferation score or marker were significantly associated with a higher risk of recurrence or death in the external test set (HR = 1.65 (95% CI: 1.05–2.61) and HR = 1.84 (95% CI: 1.17–2.89), respectively). Conclusions: The results from this study suggest that gene expression levels of proliferation scores can be predicted directly from breast cancer morphology in WSIs using CNNs and that the predictions provide prognostic information that could be used in research as well as in the clinical setting.</p>}},
  author       = {{Ekholm, Andreas and Wang, Yinxi and Vallon-Christersson, Johan and Boissin, Constance and Rantalainen, Mattias}},
  issn         = {{1471-2407}},
  keywords     = {{Artificial intelligence; Breast cancer; Computational pathology; Gene expression; Proliferation}},
  language     = {{eng}},
  number       = {{1}},
  publisher    = {{BioMed Central (BMC)}},
  series       = {{BMC Cancer}},
  title        = {{Prediction of gene expression-based breast cancer proliferation scores from histopathology whole slide images using deep learning}},
  url          = {{http://dx.doi.org/10.1186/s12885-024-13248-9}},
  doi          = {{10.1186/s12885-024-13248-9}},
  volume       = {{24}},
  year         = {{2024}},
}

Lund University Publications

LUND UNIVERSITY LIBRARIES

Prediction of gene expression-based breast cancer proliferation scores from histopathology whole slide images using deep learning