External validation of mathematical models to distinguish between benign and malignant adnexal tumors: A multicenter study by the International Ovarian Tumor Analysis group
(2007) In Clinical Cancer Research 13(15). p.4440-4447- Abstract
- Purpose: Several scoring systems have been developed to distinguish between benign and malignant adnexal tumors. However, few of them have been externally validated in new populations. Our aim was to compare their performance on a prospectively collected large multicenter data set. Experimental Design: In phase I of the International Ovarian Tumor Analysis multicenter study, patients with a persistent adnexal mass were examined with transvaginal ultrasound and color Doppler imaging. More than 50 end point variables were prospectively recorded for analysis. The outcome measure was the histologic classification of excised tissue as malignant or benign. We used the International Ovarian Tumor Analysis data to test the accuracy of previously... (More)
- Purpose: Several scoring systems have been developed to distinguish between benign and malignant adnexal tumors. However, few of them have been externally validated in new populations. Our aim was to compare their performance on a prospectively collected large multicenter data set. Experimental Design: In phase I of the International Ovarian Tumor Analysis multicenter study, patients with a persistent adnexal mass were examined with transvaginal ultrasound and color Doppler imaging. More than 50 end point variables were prospectively recorded for analysis. The outcome measure was the histologic classification of excised tissue as malignant or benign. We used the International Ovarian Tumor Analysis data to test the accuracy of previously published scoring systems. Receiver operating characteristic curves were constructed to compare the performance of the models. Results: Data from 1,066 patients were included; 800 patients (75%) had benign tumors and 266 patients (25%) had malignant tumors. The morphologic scoring system used by Lerner gave an area under the receiver operating characteristic curve (AUC) of 0.68, whereas the multimodal risk of malignancy index used by Jacobs gave an AUC of 0.88. The corresponding values for logistic regression and artificial neural network models varied between 0.76 and 0.91 and between 0.87 and 0.90, respectively. Advanced kernel-based classifiers gave an AUC of up to 0.92. Conclusion: The performance of the risk of malignancy index was similar to that of most logistic regression and artificial neural network models. The best result was obtained with a relevance vector machine with radial basis function kernel. Because the models were tested on a large multicenter data set, results are likely to be generally applicable. (Less)
Please use this url to cite or link to this publication:
https://lup.lub.lu.se/record/692771
- author
- organization
- publishing date
- 2007
- type
- Contribution to journal
- publication status
- published
- subject
- in
- Clinical Cancer Research
- volume
- 13
- issue
- 15
- pages
- 4440 - 4447
- publisher
- American Association for Cancer Research
- external identifiers
-
- wos:000248525100021
- scopus:34547668779
- ISSN
- 1078-0432
- DOI
- 10.1158/1078-0432.CCR-06-2958
- language
- English
- LU publication?
- yes
- id
- 703ccc0c-5b33-4272-bc50-f65b78bf9838 (old id 692771)
- date added to LUP
- 2016-04-01 11:52:55
- date last changed
- 2022-01-26 19:37:13
@article{703ccc0c-5b33-4272-bc50-f65b78bf9838, abstract = {{Purpose: Several scoring systems have been developed to distinguish between benign and malignant adnexal tumors. However, few of them have been externally validated in new populations. Our aim was to compare their performance on a prospectively collected large multicenter data set. Experimental Design: In phase I of the International Ovarian Tumor Analysis multicenter study, patients with a persistent adnexal mass were examined with transvaginal ultrasound and color Doppler imaging. More than 50 end point variables were prospectively recorded for analysis. The outcome measure was the histologic classification of excised tissue as malignant or benign. We used the International Ovarian Tumor Analysis data to test the accuracy of previously published scoring systems. Receiver operating characteristic curves were constructed to compare the performance of the models. Results: Data from 1,066 patients were included; 800 patients (75%) had benign tumors and 266 patients (25%) had malignant tumors. The morphologic scoring system used by Lerner gave an area under the receiver operating characteristic curve (AUC) of 0.68, whereas the multimodal risk of malignancy index used by Jacobs gave an AUC of 0.88. The corresponding values for logistic regression and artificial neural network models varied between 0.76 and 0.91 and between 0.87 and 0.90, respectively. Advanced kernel-based classifiers gave an AUC of up to 0.92. Conclusion: The performance of the risk of malignancy index was similar to that of most logistic regression and artificial neural network models. The best result was obtained with a relevance vector machine with radial basis function kernel. Because the models were tested on a large multicenter data set, results are likely to be generally applicable.}}, author = {{Van Holsbeke, Caroline and Van Calster, Ben and Valentin, Lil and Testa, Antonia C. and Ferrazzi, Enrico and Dimou, Ioannis and Lu, Chuan and Moerman, Philippe and Van Huffel, Sabine and Vergote, Ignace and Timmerman, Dirk}}, issn = {{1078-0432}}, language = {{eng}}, number = {{15}}, pages = {{4440--4447}}, publisher = {{American Association for Cancer Research}}, series = {{Clinical Cancer Research}}, title = {{External validation of mathematical models to distinguish between benign and malignant adnexal tumors: A multicenter study by the International Ovarian Tumor Analysis group}}, url = {{http://dx.doi.org/10.1158/1078-0432.CCR-06-2958}}, doi = {{10.1158/1078-0432.CCR-06-2958}}, volume = {{13}}, year = {{2007}}, }