Machine learning algorithm for classification of breast ultrasound images

Karlsson, Jennie; Ramkull, Jennifer; Arvidsson, Ida; Heyden, Anders, et al. (2022-04-04). Machine learning algorithm for classification of breast ultrasound images Medical Imaging 2022 : Computer-Aided Diagnosis, 12033,. SPIE Medical Imaging 2022:. San Diego, United States: SPIE
Download:
DOI:
Conference Proceeding/Paper | Published | English
Authors:
Karlsson, Jennie ; Ramkull, Jennifer ; Arvidsson, Ida ; Heyden, Anders , et al.
Department:
Mathematics (Faculty of Engineering)
eSSENCE: The e-Science Collaboration
ELLIIT: the Linköping-Lund initiative on IT and mobile communication
Centre for Mathematical Sciences
Mathematical Imaging Group
Stroke Imaging Research group
Engineering Mathematics (M.Sc.Eng.)
Partial differential equations
LUCC: Lund University Cancer Centre
Radiology Diagnostics, Malmö
Project:
23°N - Tropic of Cancer, accessible breast diagnostics for global health
Research Group:
Mathematical Imaging Group
Stroke Imaging Research group
Partial differential equations
Radiology Diagnostics, Malmö
Abstract:
Breast cancer is the most common type of cancer globally. Early detection is important for reducing the morbidity and mortality of breast cancer. The aim of this study was to evaluate the performance of different machine learning models to classify malignant or benign lesions on breast ultrasound images. Three different convolutional neural network approaches were implemented: (a) Simple convolutional neural network, (b) transfer learning using pre-trained InceptionV3, ResNet50V2, VGG19 and Xception and (c) deep feature networks based on combinations of the four transfer networks in (b). The data consisted of two breast ultrasound image data sets: (1) an open, single-vendor, data set collected by Cairo University at Baheya Hospital, Egypt, consisting of 437 benign lesions and 210 malignant lesions, where 10% was set to be a test set and the rest was used for training and validation (development) and (2) An in-house, multi-vendor data set collected at Unilabs Mammography Unit, Skåne University Hospital, Sweden, consisting of 13 benign lesions and 265 malignant lesions, was used as an external test set. Both test sets were used for evaluating the networks. The performance measures used were area under the receiver operating characteristic curve (AUC), sensitivity, specificity and weighted accuracy. Holdout, i.e. the splitting of the development data into training and validation data sets just once, was used to find a model with as good performance as possible. 10-fold cross-validation was also performed to provide uncertainty estimates. For the transfer networks which were obtained with holdout, Gradient-weighted Class Activation Mapping was used to generate heat maps indicating which part of the image contributed to the network’s decision. For 10-fold cross-validation it was possible to achieve a mean AUC of 92% and mean sensitivity of 95% for the transfer network based on Xception when testing on the first data set. When testing on the second data set it was possible to obtain a mean AUC of 75% and mean sensitivity of 86% for the combination of ResNet50V2 and Xception.


Breast cancer is the most common type of cancer globally. Early detection is important for reducing the morbidity and mortality of breast cancer. The aim of this study was to evaluate the performance of different machine learning models to classify malignant or benign lesions on breast ultrasound images. Three different convolutional neural network approaches were implemented: (a) Simple convolutional neural network, (b) transfer learning using pre-trained InceptionV3, ResNet50V2, VGG19 and Xception and (c) deep feature networks based on combinations of the four transfer networks in (b). The data consisted of two breast ultrasound image data sets: (1) an open, single-vendor, data set collected by Cairo University at Baheya Hospital, Egypt, consisting of 437 benign lesions and 210 malignant lesions, where 10% was set to be a test set and the rest was used for training and validation (development) and (2) An in-house, multi-vendor data set collected at Unilabs Mammography Unit, Skåne University Hospital, Sweden, consisting of 13 benign lesions and 265 malignant lesions, was used as an external test set. Both test sets were used for evaluating the networks. The performance measures used were area under the receiver operating characteristic curve (AUC), sensitivity, specificity and weighted accuracy. Holdout, i.e. the splitting of the development data into training and validation data sets just once, was used to find a model with as good performance as possible. 10-fold cross-validation was also performed to provide uncertainty estimates. For the transfer networks which were obtained with holdout, Gradient-weighted Class Activation Mapping was used to generate heat maps indicating which part of the image contributed to the network’s decision. For 10-fold cross-validation it was possible to achieve a mean AUC of 92% and mean sensitivity of 95% for the transfer network based on Xception when testing on the first data set. When testing on the second data set it was possible to obtain a mean AUC of 75% and mean sensitivity of 86% for the combination of ResNet50V2 and Xception.
Keywords:
Breast Ultrasound Images ; Breast Cancer ; Convolutional Neural Networks ; Transfer Learning ; Mathematics ; Medical Image Processing
ISBN:
9781510649422
LUP-ID:
8eb4e016-c21d-40ba-adde-7aab20ab6900 | Link: https://lup.lub.lu.se/record/8eb4e016-c21d-40ba-adde-7aab20ab6900 | Statistics

Cite this