Can we reduce the workload of mammographic screening by automatic identification of normal exams with artificial intelligence? A feasibility study

Rodriguez-Ruiz, Alejandro; Lång, Kristina; Gubern-Merida, Albert; Teuwen, Jonas; Broeders, Mireille; Gennaro, Gisella; Clauser, Paola; Helbich, Thomas H.; Chevalier, Margarita; Mertelmeier, Thomas; Wallis, Matthew G.; Andersson, Ingvar; Zackrisson, Sophia; Sechopoulos, Ioannis; Mann, Ritse M.

Can we reduce the workload of mammographic screening by automatic identification of normal exams with artificial intelligence? A feasibility study

Mark

Rodriguez-Ruiz, Alejandro ; Lång, Kristina ^LU ; Gubern-Merida, Albert ; Teuwen, Jonas ; Broeders, Mireille ; Gennaro, Gisella ; Clauser, Paola ; Helbich, Thomas H. ; Chevalier, Margarita and Mertelmeier, Thomas , et al. (2019) In European Radiology 29(9). p.4825-4832

Abstract: Purpose: To study the feasibility of automatically identifying normal digital mammography (DM) exams with artificial intelligence (AI) to reduce the breast cancer screening reading workload. Methods and materials: A total of 2652 DM exams (653 cancer) and interpretations by 101 radiologists were gathered from nine previously performed multi-reader multi-case receiver operating characteristic (MRMC ROC) studies. An AI system was used to obtain a score between 1 and 10 for each exam, representing the likelihood of cancer present. Using all AI scores between 1 and 9 as possible thresholds, the exams were divided into groups of low- and high likelihood of cancer present. It was assumed that, under the pre-selection scenario, only the... (More); Purpose: To study the feasibility of automatically identifying normal digital mammography (DM) exams with artificial intelligence (AI) to reduce the breast cancer screening reading workload. Methods and materials: A total of 2652 DM exams (653 cancer) and interpretations by 101 radiologists were gathered from nine previously performed multi-reader multi-case receiver operating characteristic (MRMC ROC) studies. An AI system was used to obtain a score between 1 and 10 for each exam, representing the likelihood of cancer present. Using all AI scores between 1 and 9 as possible thresholds, the exams were divided into groups of low- and high likelihood of cancer present. It was assumed that, under the pre-selection scenario, only the high-likelihood group would be read by radiologists, while all low-likelihood exams would be reported as normal. The area under the reader-averaged ROC curve (AUC) was calculated for the original evaluations and for the pre-selection scenarios and compared using a non-inferiority hypothesis. Results: Setting the low/high-likelihood threshold at an AI score of 5 (high likelihood > 5) results in a trade-off of approximately halving (− 47%) the workload to be read by radiologists while excluding 7% of true-positive exams. Using an AI score of 2 as threshold yields a workload reduction of 17% while only excluding 1% of true-positive exams. Pre-selection did not change the average AUC of radiologists (inferior 95% CI > − 0.05) for any threshold except at the extreme AI score of 9. Conclusion: It is possible to automatically pre-select exams using AI to significantly reduce the breast cancer screening reading workload. Key Points: • There is potential to use artificial intelligence to automatically reduce the breast cancer screening reading workload by excluding exams with a low likelihood of cancer. • The exclusion of exams with the lowest likelihood of cancer in screening might not change radiologists’ breast cancer detection performance. • When excluding exams with the lowest likelihood of cancer, the decrease in true-positive recalls would be balanced by a simultaneous reduction in false-positive recalls.
(Less)

Please use this url to cite or link to this publication: https://lup.lub.lu.se/record/6bb82827-12ed-4749-bf5d-a07aa4e52bec

author

Rodriguez-Ruiz, Alejandro ; Lång, Kristina ^LU ; Gubern-Merida, Albert ; Teuwen, Jonas ; Broeders, Mireille ; Gennaro, Gisella ; Clauser, Paola ; Helbich, Thomas H. ; Chevalier, Margarita and Mertelmeier, Thomas , et al. (More)

Rodriguez-Ruiz, Alejandro ; Lång, Kristina ^LU ; Gubern-Merida, Albert ; Teuwen, Jonas ; Broeders, Mireille ; Gennaro, Gisella ; Clauser, Paola ; Helbich, Thomas H. ; Chevalier, Margarita ; Mertelmeier, Thomas ; Wallis, Matthew G. ; Andersson, Ingvar ^LU ; Zackrisson, Sophia ^LU ; Sechopoulos, Ioannis and Mann, Ritse M. (Less)

organization

publishing date

2019-04-16

type

Contribution to journal

publication status

published

subject

Radiology and Medical Imaging

keywords

Artificial intelligence, Breast cancer, Deep learning, Mammography, Screening

in

European Radiology

volume

29

issue

9

pages

4825 - 4832

publisher

Springer Science and Business Media B.V.

external identifiers

pmid:30993432
scopus:85064708076

ISSN

0938-7994

DOI

10.1007/s00330-019-06186-9

language

English

LU publication?

yes

id

6bb82827-12ed-4749-bf5d-a07aa4e52bec

date added to LUP

2019-05-07 12:48:46

date last changed

2025-12-12 12:54:23

@article{6bb82827-12ed-4749-bf5d-a07aa4e52bec,
  abstract     = {{<p>Purpose: To study the feasibility of automatically identifying normal digital mammography (DM) exams with artificial intelligence (AI) to reduce the breast cancer screening reading workload. Methods and materials: A total of 2652 DM exams (653 cancer) and interpretations by 101 radiologists were gathered from nine previously performed multi-reader multi-case receiver operating characteristic (MRMC ROC) studies. An AI system was used to obtain a score between 1 and 10 for each exam, representing the likelihood of cancer present. Using all AI scores between 1 and 9 as possible thresholds, the exams were divided into groups of low- and high likelihood of cancer present. It was assumed that, under the pre-selection scenario, only the high-likelihood group would be read by radiologists, while all low-likelihood exams would be reported as normal. The area under the reader-averaged ROC curve (AUC) was calculated for the original evaluations and for the pre-selection scenarios and compared using a non-inferiority hypothesis. Results: Setting the low/high-likelihood threshold at an AI score of 5 (high likelihood &gt; 5) results in a trade-off of approximately halving (− 47%) the workload to be read by radiologists while excluding 7% of true-positive exams. Using an AI score of 2 as threshold yields a workload reduction of 17% while only excluding 1% of true-positive exams. Pre-selection did not change the average AUC of radiologists (inferior 95% CI &gt; − 0.05) for any threshold except at the extreme AI score of 9. Conclusion: It is possible to automatically pre-select exams using AI to significantly reduce the breast cancer screening reading workload. Key Points: • There is potential to use artificial intelligence to automatically reduce the breast cancer screening reading workload by excluding exams with a low likelihood of cancer. • The exclusion of exams with the lowest likelihood of cancer in screening might not change radiologists’ breast cancer detection performance. • When excluding exams with the lowest likelihood of cancer, the decrease in true-positive recalls would be balanced by a simultaneous reduction in false-positive recalls.</p>}},
  author       = {{Rodriguez-Ruiz, Alejandro and Lång, Kristina and Gubern-Merida, Albert and Teuwen, Jonas and Broeders, Mireille and Gennaro, Gisella and Clauser, Paola and Helbich, Thomas H. and Chevalier, Margarita and Mertelmeier, Thomas and Wallis, Matthew G. and Andersson, Ingvar and Zackrisson, Sophia and Sechopoulos, Ioannis and Mann, Ritse M.}},
  issn         = {{0938-7994}},
  keywords     = {{Artificial intelligence; Breast cancer; Deep learning; Mammography; Screening}},
  language     = {{eng}},
  month        = {{04}},
  number       = {{9}},
  pages        = {{4825--4832}},
  publisher    = {{Springer Science and Business Media B.V.}},
  series       = {{European Radiology}},
  title        = {{Can we reduce the workload of mammographic screening by automatic identification of normal exams with artificial intelligence? A feasibility study}},
  url          = {{http://dx.doi.org/10.1007/s00330-019-06186-9}},
  doi          = {{10.1007/s00330-019-06186-9}},
  volume       = {{29}},
  year         = {{2019}},
}

Lund University Publications

LUND UNIVERSITY LIBRARIES

Can we reduce the workload of mammographic screening by automatic identification of normal exams with artificial intelligence? A feasibility study