Comparison of 'pattern recognition' and logistic regression models for discrimination between benign and malignant pelvic masses: a prospective cross validation
(2001) In Ultrasound in Obstetrics & Gynecology 18(4). p.357-365- Abstract
- OBJECTIVES: To test prospectively the diagnostic performance of two logistic regression models for calculation of individual risk of malignancy in adnexal tumors (the 'Tailor model' and the 'Timmerman model'), and to compare them to that of 'pattern recognition' (subjective evaluation of the gray-scale ultrasound image and color Doppler ultrasound examination). DESIGN: Consecutive women with a pelvic mass judged clinically to be of adnexal origin underwent preoperative ultrasound examination including color and spectral Doppler examination. The same examination techniques and definitions as those used in the studies in which the logistic regression models had been created were used. The Tailor model was tested in 133 women (35 of whom hada... (More)
- OBJECTIVES: To test prospectively the diagnostic performance of two logistic regression models for calculation of individual risk of malignancy in adnexal tumors (the 'Tailor model' and the 'Timmerman model'), and to compare them to that of 'pattern recognition' (subjective evaluation of the gray-scale ultrasound image and color Doppler ultrasound examination). DESIGN: Consecutive women with a pelvic mass judged clinically to be of adnexal origin underwent preoperative ultrasound examination including color and spectral Doppler examination. The same examination techniques and definitions as those used in the studies in which the logistic regression models had been created were used. The Tailor model was tested in 133 women (35 of whom hada malignancy) and the Timmerman model in 82 women (29 of whom had a malignancy). A subset of 79 women (28 of whom had a malignancy) was used to compare the performance of the Tailor model and the Timmerman model by calculating and comparing the areas under the receiver operating characteristics curves of the two models. Sensitivity and specificity with regard to malignancy were calculated for all three methods. RESULTS: Pattern recognition performed better than the two logistic regression models (sensitivity around 85%, specificity around 90%). Using a risk of malignancy of > 50% to indicate malignancy (as suggested in the original publications), the sensitivity of the Tailor model was 69% and the specificity 88% (n = 133). The corresponding values for the Timmerman model were 62% and 79% (n = 82). The receiver operating characteristics curves showed the two logistic regression models to have similar diagnostic properties (area under the curve, 0.87 vs. 0.84; P = 0.25; n = 79). The diagnostic performance of the mathematical models was much poorer in this study than in those in which the models had been created. CONCLUSION: The poor diagnostic performance of the mathematical models can probably be explained by subtle differences in definitions and examination technique and by differences between the original tumor populations and the study population. For mathematical models to be generally useful, they probably need to be created on the basis of a very large number of tumors, and the variables in the model must be unequivocally defined and the examination technique meticulously standardized. (Less)
Please use this url to cite or link to this publication:
https://lup.lub.lu.se/record/1120562
- author
- Valentin, Lil
LU
; Hagen, B ; Tingulstad, S and Eik-Nes, S
- organization
- publishing date
- 2001
- type
- Contribution to journal
- publication status
- published
- subject
- keywords
- Doppler ultrasound, Multiple logistic regression model, Ovarian cancer, Ovarian tumor, Pattern recognition, Pelvic tumor, Ultrasound
- in
- Ultrasound in Obstetrics & Gynecology
- volume
- 18
- issue
- 4
- pages
- 357 - 365
- publisher
- John Wiley & Sons Inc.
- external identifiers
-
- pmid:11778996
- scopus:0034774279
- pmid:11778996
- ISSN
- 1469-0705
- DOI
- 10.1046/j.0960-7692.2001.00500.x
- language
- English
- LU publication?
- yes
- id
- 8ce0442a-4d13-44f8-95a8-45818b3b9bcd (old id 1120562)
- alternative location
- http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=11778996
- date added to LUP
- 2016-04-01 15:40:31
- date last changed
- 2025-04-04 14:07:17
@article{8ce0442a-4d13-44f8-95a8-45818b3b9bcd, abstract = {{OBJECTIVES: To test prospectively the diagnostic performance of two logistic regression models for calculation of individual risk of malignancy in adnexal tumors (the 'Tailor model' and the 'Timmerman model'), and to compare them to that of 'pattern recognition' (subjective evaluation of the gray-scale ultrasound image and color Doppler ultrasound examination). DESIGN: Consecutive women with a pelvic mass judged clinically to be of adnexal origin underwent preoperative ultrasound examination including color and spectral Doppler examination. The same examination techniques and definitions as those used in the studies in which the logistic regression models had been created were used. The Tailor model was tested in 133 women (35 of whom hada malignancy) and the Timmerman model in 82 women (29 of whom had a malignancy). A subset of 79 women (28 of whom had a malignancy) was used to compare the performance of the Tailor model and the Timmerman model by calculating and comparing the areas under the receiver operating characteristics curves of the two models. Sensitivity and specificity with regard to malignancy were calculated for all three methods. RESULTS: Pattern recognition performed better than the two logistic regression models (sensitivity around 85%, specificity around 90%). Using a risk of malignancy of > 50% to indicate malignancy (as suggested in the original publications), the sensitivity of the Tailor model was 69% and the specificity 88% (n = 133). The corresponding values for the Timmerman model were 62% and 79% (n = 82). The receiver operating characteristics curves showed the two logistic regression models to have similar diagnostic properties (area under the curve, 0.87 vs. 0.84; P = 0.25; n = 79). The diagnostic performance of the mathematical models was much poorer in this study than in those in which the models had been created. CONCLUSION: The poor diagnostic performance of the mathematical models can probably be explained by subtle differences in definitions and examination technique and by differences between the original tumor populations and the study population. For mathematical models to be generally useful, they probably need to be created on the basis of a very large number of tumors, and the variables in the model must be unequivocally defined and the examination technique meticulously standardized.}}, author = {{Valentin, Lil and Hagen, B and Tingulstad, S and Eik-Nes, S}}, issn = {{1469-0705}}, keywords = {{Doppler ultrasound; Multiple logistic regression model; Ovarian cancer; Ovarian tumor; Pattern recognition; Pelvic tumor; Ultrasound}}, language = {{eng}}, number = {{4}}, pages = {{357--365}}, publisher = {{John Wiley & Sons Inc.}}, series = {{Ultrasound in Obstetrics & Gynecology}}, title = {{Comparison of 'pattern recognition' and logistic regression models for discrimination between benign and malignant pelvic masses: a prospective cross validation}}, url = {{http://dx.doi.org/10.1046/j.0960-7692.2001.00500.x}}, doi = {{10.1046/j.0960-7692.2001.00500.x}}, volume = {{18}}, year = {{2001}}, }