Advanced

International Endometrial Tumor Analysis (IETA) terminology in women with postmenopausal bleeding and sonographic endometrial thickness ≥ 4.5 mm : agreement and reliability study

Sladkevicius, P. LU ; Installé, A.; Van Den Bosch, T.; Timmerman, D.; Benacerraf, B.; Jokubkiene, L. LU ; Di Legge, A.; Votino, A.; Zannoni, L. and De Moor, B., et al. (2018) In Ultrasound in Obstetrics and Gynecology 51(2). p.259-268
Abstract

Objective: To estimate intra- and interrater agreement and reliability with regard to describing ultrasound images of the endometrium using the International Endometrial Tumor Analysis (IETA) terminology. Methods: Four expert and four non-expert raters assessed videoclips of transvaginal ultrasound examinations of the endometrium obtained from 99 women with postmenopausal bleeding and sonographic endometrial thickness ≥ 4.5 mm but without fluid in the uterine cavity. The following features were rated: endometrial echogenicity, endometrial midline, bright edge, endometrial–myometrial junction, color score, vascular pattern, irregularly branching vessels and color splashes. The color content of the endometrial scan was estimated using a... (More)

Objective: To estimate intra- and interrater agreement and reliability with regard to describing ultrasound images of the endometrium using the International Endometrial Tumor Analysis (IETA) terminology. Methods: Four expert and four non-expert raters assessed videoclips of transvaginal ultrasound examinations of the endometrium obtained from 99 women with postmenopausal bleeding and sonographic endometrial thickness ≥ 4.5 mm but without fluid in the uterine cavity. The following features were rated: endometrial echogenicity, endometrial midline, bright edge, endometrial–myometrial junction, color score, vascular pattern, irregularly branching vessels and color splashes. The color content of the endometrial scan was estimated using a visual analog scale graded from 0 to 100. To estimate intrarater agreement and reliability, the same videoclips were assessed twice with a minimum of 2 months' interval. The raters were blinded to their own results and to those of the other raters. Results: Interrater differences in the described prevalence of most IETA variables were substantial, and some variable categories were observed rarely. Specific agreement was poor for variables with many categories. For binary variables, specific agreement was better for absence than for presence of a category. For variables with more than two outcome categories, specific agreement for expert and non-expert raters was best for not-defined endometrial midline (93% and 96%), regular endometrial–myometrial junction (72% and 70%) and three-layer endometrial pattern (67% and 56%). The grayscale ultrasound variable with the best reliability was uniform vs non-uniform echogenicity (multirater kappa (κ), 0.55 for expert and 0.52 for non-expert raters), and the variables with the lowest reliability were appearance of the endometrial–myometrial junction (κ, 0.25 and 0.16) and the nine-category endometrial echogenicity variable (κ, 0.29 and 0.28). The most reliable color Doppler variable was color score (mean weighted κ, 0.77 and 0.69). Intra- and interrater agreement and reliability were similar for experts and non-experts. Conclusions: Inter- and intrarater agreement and reliability when using IETA terminology were limited. This may have implications when assessing the association between a particular ultrasound feature and a specific histological diagnosis, because lack of reproducibility reduces the reliability of the association between a feature and the outcome. Future studies should investigate whether using fewer categories of variable or offering practical training could improve agreement and reliability.

(Less)
Please use this url to cite or link to this publication:
author
, et al. (More)
(Less)
organization
publishing date
type
Contribution to journal
publication status
published
subject
keywords
Doppler ultrasonography, endometrium, observer variation, reproducibility of results, ultrasonography
in
Ultrasound in Obstetrics and Gynecology
volume
51
issue
2
pages
10 pages
publisher
John Wiley & Sons
external identifiers
  • scopus:85041647464
ISSN
0960-7692
DOI
10.1002/uog.18813
language
English
LU publication?
yes
id
fe5de966-aeb6-47ae-8a69-f7b9906173c4
date added to LUP
2018-03-06 13:19:13
date last changed
2018-05-29 10:58:51
@article{fe5de966-aeb6-47ae-8a69-f7b9906173c4,
  abstract     = {<p>Objective: To estimate intra- and interrater agreement and reliability with regard to describing ultrasound images of the endometrium using the International Endometrial Tumor Analysis (IETA) terminology. Methods: Four expert and four non-expert raters assessed videoclips of transvaginal ultrasound examinations of the endometrium obtained from 99 women with postmenopausal bleeding and sonographic endometrial thickness ≥ 4.5 mm but without fluid in the uterine cavity. The following features were rated: endometrial echogenicity, endometrial midline, bright edge, endometrial–myometrial junction, color score, vascular pattern, irregularly branching vessels and color splashes. The color content of the endometrial scan was estimated using a visual analog scale graded from 0 to 100. To estimate intrarater agreement and reliability, the same videoclips were assessed twice with a minimum of 2 months' interval. The raters were blinded to their own results and to those of the other raters. Results: Interrater differences in the described prevalence of most IETA variables were substantial, and some variable categories were observed rarely. Specific agreement was poor for variables with many categories. For binary variables, specific agreement was better for absence than for presence of a category. For variables with more than two outcome categories, specific agreement for expert and non-expert raters was best for not-defined endometrial midline (93% and 96%), regular endometrial–myometrial junction (72% and 70%) and three-layer endometrial pattern (67% and 56%). The grayscale ultrasound variable with the best reliability was uniform vs non-uniform echogenicity (multirater kappa (κ), 0.55 for expert and 0.52 for non-expert raters), and the variables with the lowest reliability were appearance of the endometrial–myometrial junction (κ, 0.25 and 0.16) and the nine-category endometrial echogenicity variable (κ, 0.29 and 0.28). The most reliable color Doppler variable was color score (mean weighted κ, 0.77 and 0.69). Intra- and interrater agreement and reliability were similar for experts and non-experts. Conclusions: Inter- and intrarater agreement and reliability when using IETA terminology were limited. This may have implications when assessing the association between a particular ultrasound feature and a specific histological diagnosis, because lack of reproducibility reduces the reliability of the association between a feature and the outcome. Future studies should investigate whether using fewer categories of variable or offering practical training could improve agreement and reliability.</p>},
  author       = {Sladkevicius, P. and Installé, A. and Van Den Bosch, T. and Timmerman, D. and Benacerraf, B. and Jokubkiene, L. and Di Legge, A. and Votino, A. and Zannoni, L. and De Moor, B. and De Cock, B. and Van Calster, B. and Valentin, L.},
  issn         = {0960-7692},
  keyword      = {Doppler ultrasonography,endometrium,observer variation,reproducibility of results,ultrasonography},
  language     = {eng},
  month        = {02},
  number       = {2},
  pages        = {259--268},
  publisher    = {John Wiley & Sons},
  series       = {Ultrasound in Obstetrics and Gynecology},
  title        = {International Endometrial Tumor Analysis (IETA) terminology in women with postmenopausal bleeding and sonographic endometrial thickness ≥ 4.5 mm : agreement and reliability study},
  url          = {http://dx.doi.org/10.1002/uog.18813},
  volume       = {51},
  year         = {2018},
}