External validation of mathematical models to distinguish between benign and malignant adnexal tumors: A multicenter study by the International Ovarian Tumor Analysis group

Van Holsbeke, Caroline; Van Calster, Ben; Valentin, Lil; Testa, Antonia C.; Ferrazzi, Enrico; Dimou, Ioannis; Lu, Chuan; Moerman, Philippe; Van Huffel, Sabine; Vergote, Ignace; Timmerman, Dirk

External validation of mathematical models to distinguish between benign and malignant adnexal tumors: A multicenter study by the International Ovarian Tumor Analysis group

Mark

Van Holsbeke, Caroline ; Van Calster, Ben ; Valentin, Lil ^LU

; Testa, Antonia C. ; Ferrazzi, Enrico ; Dimou, Ioannis ; Lu, Chuan ; Moerman, Philippe ; Van Huffel, Sabine and Vergote, Ignace , et al. (2007) In Clinical Cancer Research 13(15). p.4440-4447

Abstract: Purpose: Several scoring systems have been developed to distinguish between benign and malignant adnexal tumors. However, few of them have been externally validated in new populations. Our aim was to compare their performance on a prospectively collected large multicenter data set. Experimental Design: In phase I of the International Ovarian Tumor Analysis multicenter study, patients with a persistent adnexal mass were examined with transvaginal ultrasound and color Doppler imaging. More than 50 end point variables were prospectively recorded for analysis. The outcome measure was the histologic classification of excised tissue as malignant or benign. We used the International Ovarian Tumor Analysis data to test the accuracy of previously... (More); Purpose: Several scoring systems have been developed to distinguish between benign and malignant adnexal tumors. However, few of them have been externally validated in new populations. Our aim was to compare their performance on a prospectively collected large multicenter data set. Experimental Design: In phase I of the International Ovarian Tumor Analysis multicenter study, patients with a persistent adnexal mass were examined with transvaginal ultrasound and color Doppler imaging. More than 50 end point variables were prospectively recorded for analysis. The outcome measure was the histologic classification of excised tissue as malignant or benign. We used the International Ovarian Tumor Analysis data to test the accuracy of previously published scoring systems. Receiver operating characteristic curves were constructed to compare the performance of the models. Results: Data from 1,066 patients were included; 800 patients (75%) had benign tumors and 266 patients (25%) had malignant tumors. The morphologic scoring system used by Lerner gave an area under the receiver operating characteristic curve (AUC) of 0.68, whereas the multimodal risk of malignancy index used by Jacobs gave an AUC of 0.88. The corresponding values for logistic regression and artificial neural network models varied between 0.76 and 0.91 and between 0.87 and 0.90, respectively. Advanced kernel-based classifiers gave an AUC of up to 0.92. Conclusion: The performance of the risk of malignancy index was similar to that of most logistic regression and artificial neural network models. The best result was obtained with a relevance vector machine with radial basis function kernel. Because the models were tested on a large multicenter data set, results are likely to be generally applicable. (Less)

Please use this url to cite or link to this publication: https://lup.lub.lu.se/record/692771

author

Van Holsbeke, Caroline ; Van Calster, Ben ; Valentin, Lil ^LU

; Testa, Antonia C. ; Ferrazzi, Enrico ; Dimou, Ioannis ; Lu, Chuan ; Moerman, Philippe ; Van Huffel, Sabine and Vergote, Ignace , et al. (More)

Van Holsbeke, Caroline ; Van Calster, Ben ; Valentin, Lil ^LU

; Testa, Antonia C. ; Ferrazzi, Enrico ; Dimou, Ioannis ; Lu, Chuan ; Moerman, Philippe ; Van Huffel, Sabine ; Vergote, Ignace and Timmerman, Dirk (Less)

organization

Obstetric, Gynaecological and Prenatal Ultrasound Research (research group)

publishing date

2007

type

Contribution to journal

publication status

published

subject

Cancer and Oncology

in

Clinical Cancer Research

volume

13

issue

15

pages

4440 - 4447

publisher

American Association for Cancer Research Inc.

external identifiers

wos:000248525100021
scopus:34547668779

ISSN

1078-0432

DOI

10.1158/1078-0432.CCR-06-2958

language

English

LU publication?

yes

id

703ccc0c-5b33-4272-bc50-f65b78bf9838 (old id 692771)

date added to LUP

2016-04-01 11:52:55

date last changed

2025-10-14 11:49:43

@article{703ccc0c-5b33-4272-bc50-f65b78bf9838,
  abstract     = {{Purpose: Several scoring systems have been developed to distinguish between benign and malignant adnexal tumors. However, few of them have been externally validated in new populations. Our aim was to compare their performance on a prospectively collected large multicenter data set. Experimental Design: In phase I of the International Ovarian Tumor Analysis multicenter study, patients with a persistent adnexal mass were examined with transvaginal ultrasound and color Doppler imaging. More than 50 end point variables were prospectively recorded for analysis. The outcome measure was the histologic classification of excised tissue as malignant or benign. We used the International Ovarian Tumor Analysis data to test the accuracy of previously published scoring systems. Receiver operating characteristic curves were constructed to compare the performance of the models. Results: Data from 1,066 patients were included; 800 patients (75%) had benign tumors and 266 patients (25%) had malignant tumors. The morphologic scoring system used by Lerner gave an area under the receiver operating characteristic curve (AUC) of 0.68, whereas the multimodal risk of malignancy index used by Jacobs gave an AUC of 0.88. The corresponding values for logistic regression and artificial neural network models varied between 0.76 and 0.91 and between 0.87 and 0.90, respectively. Advanced kernel-based classifiers gave an AUC of up to 0.92. Conclusion: The performance of the risk of malignancy index was similar to that of most logistic regression and artificial neural network models. The best result was obtained with a relevance vector machine with radial basis function kernel. Because the models were tested on a large multicenter data set, results are likely to be generally applicable.}},
  author       = {{Van Holsbeke, Caroline and Van Calster, Ben and Valentin, Lil and Testa, Antonia C. and Ferrazzi, Enrico and Dimou, Ioannis and Lu, Chuan and Moerman, Philippe and Van Huffel, Sabine and Vergote, Ignace and Timmerman, Dirk}},
  issn         = {{1078-0432}},
  language     = {{eng}},
  number       = {{15}},
  pages        = {{4440--4447}},
  publisher    = {{American Association for Cancer Research Inc.}},
  series       = {{Clinical Cancer Research}},
  title        = {{External validation of mathematical models to distinguish between benign and malignant adnexal tumors: A multicenter study by the International Ovarian Tumor Analysis group}},
  url          = {{http://dx.doi.org/10.1158/1078-0432.CCR-06-2958}},
  doi          = {{10.1158/1078-0432.CCR-06-2958}},
  volume       = {{13}},
  year         = {{2007}},
}

Lund University Publications

LUND UNIVERSITY LIBRARIES

External validation of mathematical models to distinguish between benign and malignant adnexal tumors: A multicenter study by the International Ovarian Tumor Analysis group