Abstract P2-03-01: Analytical validation of a standardized scoring protocol for Ki67 assessed on breast excision whole sections: An international multicenter collaboration

Nielsen, Torsten O; Leung, Samuel CY; Zabaglo, LA; Arun, I; Badve, SS; Bane, AL; Bartlet, JMS; Borgquist, Signe; Chang, MC; Dodson, A; Ehinger, Anna; Fineberg, S; Focke, CM; Gao, D; Gown, Allen M; Gutierrez, Carolina; Hugh, JC; Kos, Z; Lænkholm, Anne-Vibeke; Mastropasqua, Mauro G; Moriya, Takuya; Nofech-Mozes, S; Osborne, CK; Penault-Llorca, Frédérique M; Piper, Tammy; Sakatani, Takashi; Salgado, Roberto; Starczynski, Jane; Sugie, T; van der Vegt, B; Viale, Giuseppe; Hayes, Daniel F; McShane, Lisa M; Dowsett, Mitch

Abstract P2-03-01: Analytical validation of a standardized scoring protocol for Ki67 assessed on breast excision whole sections: An international multicenter collaboration

Mark

Nielsen, Torsten O ; Leung, Samuel CY ; Zabaglo, LA ; Arun, I ; Badve, SS ; Bane, AL ; Bartlet, JMS ; Borgquist, Signe ^LU ; Chang, MC and Dodson, A , et al. (2018) San Antonio Breast Cancer Symposium, 2017 In Cancer research. Supplement 78(4).

Abstract: Aims: (i) Determine whether between-observer reproducibility for Ki67 when assessed on whole sections according to a standardized scoring protocol is adequate for clinical application. (ii) Compare between-observer reproducibility of Ki67 scores assessed on hot-spots to scores using a global method that averages across a tissue section.

Background: The nuclear proliferation biomarker Ki67 has multiple potential roles in breast cancer, including aiding decisions based on prognosis, but unacceptable levels of between-laboratory variability have been observed. The International Ki67 in Breast Cancer Working Group has undertaken a systematic program to determine whether Ki67 measurement can be analytically validated and standardized... (More); Aims: (i) Determine whether between-observer reproducibility for Ki67 when assessed on whole sections according to a standardized scoring protocol is adequate for clinical application. (ii) Compare between-observer reproducibility of Ki67 scores assessed on hot-spots to scores using a global method that averages across a tissue section.

Background: The nuclear proliferation biomarker Ki67 has multiple potential roles in breast cancer, including aiding decisions based on prognosis, but unacceptable levels of between-laboratory variability have been observed. The International Ki67 in Breast Cancer Working Group has undertaken a systematic program to determine whether Ki67 measurement can be analytically validated and standardized across labs. In phase 1, variability in visual interpretation was identified as an important source of variability. Phases 2 and 3a showed that adherence to defined scoring methods substantially improved reproducibility in scoring tissue microarrays and core-cut biopsies. We now assess whether acceptable reproducibility can be achieved on whole sections.

Methods: Adjacent sections from 30 primary ER+ breast cancers were centrally stained for Ki67 to assemble 4 sets of 30 stained tumor sections, circulated around 23 labs in 12 countries. Ki67 was scored by 2 methods by all labs: (a) global: 4 fields of 100 tumor cells each were selected to reflect observed heterogeneity in nuclear staining (b) hot-spot: the field with highest Ki67 percentage of tumor cells with nuclear staining was selected and up to 500 cells scored. Ki67 scores were log2-transformed for statistical analyses and back-transformed for presentation. The primary objective was to assess whether either method could achieve an intraclass correlation coefficient (ICC) significantly greater than 0.8, considered substantial to almost-perfect reproducibility. Secondary objectives were to assess which method had highest observed ICC and to assess whether observers identified the same “hot-spots”.

Results: ICC for the global method was 0.87 (95%CI: 0.799-0.93), marginally meeting the prespecified success criterion. The ICC for the hot-spot method was 0.83 (95%CI: 0.74-0.90) and had a CI extending below the success criterion. Across the 23 labs, geometric mean value of the 30 scores ranged from 8.5 to 19.6 for the global method and from 12.8 to 30.3 for the hot-spot method. The overall mean (95% CI) of these values was 12.9 (11.9-14.0) and 20.9 (19.1-22.8), respectively. Visually, between-laboratory agreement in location of selected hot-spot varies between cases. The median times for scoring were 9 and 6 minutes for global and hot-spot methods respectively.

Conclusions: The global method marginally met the prespecified criterion of success; it should now be evaluated for clinical validity in appropriate cohorts of cases. The hot-spot method was observed to have slightly less reproducibility between labs. The time taken for scoring by either method is practical using counting software we are making publicly available. Establishment of external quality assessment schemes is likely to improve the reproducibility between labs further
(Less)

Please use this url to cite or link to this publication: https://lup.lub.lu.se/record/8d85a0ef-fa0c-4b61-b7be-771442a59736

author

Nielsen, Torsten O ; Leung, Samuel CY ; Zabaglo, LA ; Arun, I ; Badve, SS ; Bane, AL ; Bartlet, JMS ; Borgquist, Signe ^LU ; Chang, MC and Dodson, A , et al. (More)

Nielsen, Torsten O ; Leung, Samuel CY ; Zabaglo, LA ; Arun, I ; Badve, SS ; Bane, AL ; Bartlet, JMS ; Borgquist, Signe ^LU ; Chang, MC ; Dodson, A ; Ehinger, Anna ^LU

; Fineberg, S ; Focke, CM ; Gao, D ; Gown, Allen M ; Gutierrez, Carolina ; Hugh, JC ; Kos, Z ; Lænkholm, Anne-Vibeke ; Mastropasqua, Mauro G ; Moriya, Takuya ; Nofech-Mozes, S ; Osborne, CK ; Penault-Llorca, Frédérique M ; Piper, Tammy ; Sakatani, Takashi ; Salgado, Roberto ; Starczynski, Jane ; Sugie, T ; van der Vegt, B ; Viale, Giuseppe ; Hayes, Daniel F ; McShane, Lisa M and Dowsett, Mitch (Less)

organization

publishing date

2018-02

type

Contribution to journal

publication status

published

subject

in

Cancer research. Supplement

volume

78

issue

4

publisher

American Association for Cancer Research Inc.

conference name

San Antonio Breast Cancer Symposium, 2017

conference location

San Antonio, United States

conference dates

2017-12-05 - 2017-12-09

ISSN

1538-7445

DOI

10.1158/1538-7445.SABCS17-P2-03-01

language

English

LU publication?

yes

id

8d85a0ef-fa0c-4b61-b7be-771442a59736

alternative location

http://cancerres.aacrjournals.org/content/78/4_Supplement/P2-03-01

date added to LUP

2018-03-05 17:40:42

date last changed

2025-04-04 14:02:06

@misc{8d85a0ef-fa0c-4b61-b7be-771442a59736,
  abstract     = {{Aims: (i) Determine whether between-observer reproducibility for Ki67 when assessed on whole sections according to a standardized scoring protocol is adequate for clinical application. (ii) Compare between-observer reproducibility of Ki67 scores assessed on hot-spots to scores using a global method that averages across a tissue section.<br/><br/>Background: The nuclear proliferation biomarker Ki67 has multiple potential roles in breast cancer, including aiding decisions based on prognosis, but unacceptable levels of between-laboratory variability have been observed. The International Ki67 in Breast Cancer Working Group has undertaken a systematic program to determine whether Ki67 measurement can be analytically validated and standardized across labs. In phase 1, variability in visual interpretation was identified as an important source of variability. Phases 2 and 3a showed that adherence to defined scoring methods substantially improved reproducibility in scoring tissue microarrays and core-cut biopsies. We now assess whether acceptable reproducibility can be achieved on whole sections.<br/><br/>Methods: Adjacent sections from 30 primary ER+ breast cancers were centrally stained for Ki67 to assemble 4 sets of 30 stained tumor sections, circulated around 23 labs in 12 countries. Ki67 was scored by 2 methods by all labs: (a) global: 4 fields of 100 tumor cells each were selected to reflect observed heterogeneity in nuclear staining (b) hot-spot: the field with highest Ki67 percentage of tumor cells with nuclear staining was selected and up to 500 cells scored. Ki67 scores were log2-transformed for statistical analyses and back-transformed for presentation. The primary objective was to assess whether either method could achieve an intraclass correlation coefficient (ICC) significantly greater than 0.8, considered substantial to almost-perfect reproducibility. Secondary objectives were to assess which method had highest observed ICC and to assess whether observers identified the same “hot-spots”.<br/><br/>Results: ICC for the global method was 0.87 (95%CI: 0.799-0.93), marginally meeting the prespecified success criterion. The ICC for the hot-spot method was 0.83 (95%CI: 0.74-0.90) and had a CI extending below the success criterion. Across the 23 labs, geometric mean value of the 30 scores ranged from 8.5 to 19.6 for the global method and from 12.8 to 30.3 for the hot-spot method. The overall mean (95% CI) of these values was 12.9 (11.9-14.0) and 20.9 (19.1-22.8), respectively. Visually, between-laboratory agreement in location of selected hot-spot varies between cases. The median times for scoring were 9 and 6 minutes for global and hot-spot methods respectively.<br/><br/>Conclusions: The global method marginally met the prespecified criterion of success; it should now be evaluated for clinical validity in appropriate cohorts of cases. The hot-spot method was observed to have slightly less reproducibility between labs. The time taken for scoring by either method is practical using counting software we are making publicly available. Establishment of external quality assessment schemes is likely to improve the reproducibility between labs further<br/>}},
  author       = {{Nielsen, Torsten O and Leung, Samuel CY and Zabaglo, LA and Arun, I and Badve, SS and Bane, AL and Bartlet, JMS and Borgquist, Signe and Chang, MC and Dodson, A and Ehinger, Anna and Fineberg, S and Focke, CM and Gao, D and Gown, Allen M and Gutierrez, Carolina and Hugh, JC and Kos, Z and Lænkholm, Anne-Vibeke and Mastropasqua, Mauro G and Moriya, Takuya and Nofech-Mozes, S and Osborne, CK and Penault-Llorca, Frédérique M and Piper, Tammy and Sakatani, Takashi and Salgado, Roberto and Starczynski, Jane and Sugie, T and van der Vegt, B and Viale, Giuseppe and Hayes, Daniel F and McShane, Lisa M and Dowsett, Mitch}},
  issn         = {{1538-7445}},
  language     = {{eng}},
  note         = {{Conference Abstract}},
  number       = {{4}},
  publisher    = {{American Association for Cancer Research Inc.}},
  series       = {{Cancer research. Supplement}},
  title        = {{Abstract P2-03-01: Analytical validation of a standardized scoring protocol for Ki67 assessed on breast excision whole sections: An international multicenter collaboration}},
  url          = {{http://dx.doi.org/10.1158/1538-7445.SABCS17-P2-03-01}},
  doi          = {{10.1158/1538-7445.SABCS17-P2-03-01}},
  volume       = {{78}},
  year         = {{2018}},
}

Lund University Publications

LUND UNIVERSITY LIBRARIES

Abstract P2-03-01: Analytical validation of a standardized scoring protocol for Ki67 assessed on breast excision whole sections: An international multicenter collaboration