Comparison of 'pattern recognition' and logistic regression models for discrimination between benign and malignant pelvic masses: a prospective cross validation

Valentin, Lil; Hagen, B; Tingulstad, S; Eik-Nes, S

Comparison of 'pattern recognition' and logistic regression models for discrimination between benign and malignant pelvic masses: a prospective cross validation

Mark

Valentin, Lil ^LU

; Hagen, B ; Tingulstad, S and Eik-Nes, S (2001) In Ultrasound in Obstetrics & Gynecology 18(4). p.357-365

Abstract: OBJECTIVES: To test prospectively the diagnostic performance of two logistic regression models for calculation of individual risk of malignancy in adnexal tumors (the 'Tailor model' and the 'Timmerman model'), and to compare them to that of 'pattern recognition' (subjective evaluation of the gray-scale ultrasound image and color Doppler ultrasound examination). DESIGN: Consecutive women with a pelvic mass judged clinically to be of adnexal origin underwent preoperative ultrasound examination including color and spectral Doppler examination. The same examination techniques and definitions as those used in the studies in which the logistic regression models had been created were used. The Tailor model was tested in 133 women (35 of whom hada... (More); OBJECTIVES: To test prospectively the diagnostic performance of two logistic regression models for calculation of individual risk of malignancy in adnexal tumors (the 'Tailor model' and the 'Timmerman model'), and to compare them to that of 'pattern recognition' (subjective evaluation of the gray-scale ultrasound image and color Doppler ultrasound examination). DESIGN: Consecutive women with a pelvic mass judged clinically to be of adnexal origin underwent preoperative ultrasound examination including color and spectral Doppler examination. The same examination techniques and definitions as those used in the studies in which the logistic regression models had been created were used. The Tailor model was tested in 133 women (35 of whom hada malignancy) and the Timmerman model in 82 women (29 of whom had a malignancy). A subset of 79 women (28 of whom had a malignancy) was used to compare the performance of the Tailor model and the Timmerman model by calculating and comparing the areas under the receiver operating characteristics curves of the two models. Sensitivity and specificity with regard to malignancy were calculated for all three methods. RESULTS: Pattern recognition performed better than the two logistic regression models (sensitivity around 85%, specificity around 90%). Using a risk of malignancy of > 50% to indicate malignancy (as suggested in the original publications), the sensitivity of the Tailor model was 69% and the specificity 88% (n = 133). The corresponding values for the Timmerman model were 62% and 79% (n = 82). The receiver operating characteristics curves showed the two logistic regression models to have similar diagnostic properties (area under the curve, 0.87 vs. 0.84; P = 0.25; n = 79). The diagnostic performance of the mathematical models was much poorer in this study than in those in which the models had been created. CONCLUSION: The poor diagnostic performance of the mathematical models can probably be explained by subtle differences in definitions and examination technique and by differences between the original tumor populations and the study population. For mathematical models to be generally useful, they probably need to be created on the basis of a very large number of tumors, and the variables in the model must be unequivocally defined and the examination technique meticulously standardized. (Less)

Please use this url to cite or link to this publication: https://lup.lub.lu.se/record/1120562

author

Valentin, Lil ^LU

; Hagen, B ; Tingulstad, S and Eik-Nes, S

organization

Obstetric, Gynaecological and Prenatal Ultrasound Research (research group)

publishing date

2001

type

Contribution to journal

publication status

published

subject

Radiology and Medical Imaging

keywords

Doppler ultrasound, Multiple logistic regression model, Ovarian cancer, Ovarian tumor, Pattern recognition, Pelvic tumor, Ultrasound

in

Ultrasound in Obstetrics & Gynecology

volume

18

issue

4

pages

357 - 365

publisher

John Wiley & Sons Inc.

external identifiers

pmid:11778996
scopus:0034774279
pmid:11778996

ISSN

1469-0705

DOI

10.1046/j.0960-7692.2001.00500.x

language

English

LU publication?

yes

id

8ce0442a-4d13-44f8-95a8-45818b3b9bcd (old id 1120562)

alternative location

http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=11778996

date added to LUP

2016-04-01 15:40:31

date last changed

2025-04-04 14:07:17

@article{8ce0442a-4d13-44f8-95a8-45818b3b9bcd,
  abstract     = {{OBJECTIVES: To test prospectively the diagnostic performance of two logistic regression models for calculation of individual risk of malignancy in adnexal tumors (the 'Tailor model' and the 'Timmerman model'), and to compare them to that of 'pattern recognition' (subjective evaluation of the gray-scale ultrasound image and color Doppler ultrasound examination). DESIGN: Consecutive women with a pelvic mass judged clinically to be of adnexal origin underwent preoperative ultrasound examination including color and spectral Doppler examination. The same examination techniques and definitions as those used in the studies in which the logistic regression models had been created were used. The Tailor model was tested in 133 women (35 of whom hada malignancy) and the Timmerman model in 82 women (29 of whom had a malignancy). A subset of 79 women (28 of whom had a malignancy) was used to compare the performance of the Tailor model and the Timmerman model by calculating and comparing the areas under the receiver operating characteristics curves of the two models. Sensitivity and specificity with regard to malignancy were calculated for all three methods. RESULTS: Pattern recognition performed better than the two logistic regression models (sensitivity around 85%, specificity around 90%). Using a risk of malignancy of &gt; 50% to indicate malignancy (as suggested in the original publications), the sensitivity of the Tailor model was 69% and the specificity 88% (n = 133). The corresponding values for the Timmerman model were 62% and 79% (n = 82). The receiver operating characteristics curves showed the two logistic regression models to have similar diagnostic properties (area under the curve, 0.87 vs. 0.84; P = 0.25; n = 79). The diagnostic performance of the mathematical models was much poorer in this study than in those in which the models had been created. CONCLUSION: The poor diagnostic performance of the mathematical models can probably be explained by subtle differences in definitions and examination technique and by differences between the original tumor populations and the study population. For mathematical models to be generally useful, they probably need to be created on the basis of a very large number of tumors, and the variables in the model must be unequivocally defined and the examination technique meticulously standardized.}},
  author       = {{Valentin, Lil and Hagen, B and Tingulstad, S and Eik-Nes, S}},
  issn         = {{1469-0705}},
  keywords     = {{Doppler ultrasound; Multiple logistic regression model; Ovarian cancer; Ovarian tumor; Pattern recognition; Pelvic tumor; Ultrasound}},
  language     = {{eng}},
  number       = {{4}},
  pages        = {{357--365}},
  publisher    = {{John Wiley & Sons Inc.}},
  series       = {{Ultrasound in Obstetrics & Gynecology}},
  title        = {{Comparison of 'pattern recognition' and logistic regression models for discrimination between benign and malignant pelvic masses: a prospective cross validation}},
  url          = {{http://dx.doi.org/10.1046/j.0960-7692.2001.00500.x}},
  doi          = {{10.1046/j.0960-7692.2001.00500.x}},
  volume       = {{18}},
  year         = {{2001}},
}

Lund University Publications

LUND UNIVERSITY LIBRARIES

Comparison of 'pattern recognition' and logistic regression models for discrimination between benign and malignant pelvic masses: a prospective cross validation