Skip to main content

Lund University Publications

LUND UNIVERSITY LIBRARIES

A practical approach to curate clonal hematopoiesis of indeterminate potential in human genetic data sets

Vlasschaert, Caitlyn ; Mack, Taralynn ; Heimlich, J. Brett ; Niroula, Abhishek LU ; Uddin, Md Mesbah ; Weinstock, Joshua ; Sharber, Brian ; Silver, Alexander J. ; Xu, Yaomin and Savona, Michael , et al. (2023) In Blood 141(18). p.2214-2223
Abstract

Clonal hematopoiesis of indeterminate potential (CHIP) is a common form of age-related somatic mosaicism that is associated with significant morbidity and mortality. CHIP mutations can be identified in peripheral blood samples that are sequenced using approaches that cover the whole genome, the whole exome, or targeted genetic regions; however, differentiating true CHIP mutations from sequencing artifacts and germ line variants is a considerable bioinformatic challenge. We present a stepwise method that combines filtering based on sequencing metrics, variant annotation, and population-based associations to increase the accuracy of CHIP calls. We apply this approach to ascertain CHIP in ∼550 000 individuals in the UK Biobank complete... (More)

Clonal hematopoiesis of indeterminate potential (CHIP) is a common form of age-related somatic mosaicism that is associated with significant morbidity and mortality. CHIP mutations can be identified in peripheral blood samples that are sequenced using approaches that cover the whole genome, the whole exome, or targeted genetic regions; however, differentiating true CHIP mutations from sequencing artifacts and germ line variants is a considerable bioinformatic challenge. We present a stepwise method that combines filtering based on sequencing metrics, variant annotation, and population-based associations to increase the accuracy of CHIP calls. We apply this approach to ascertain CHIP in ∼550 000 individuals in the UK Biobank complete whole exome cohort and the All of Us Research Program initial whole genome release cohort. CHIP ascertainment on this scale unmasks recurrent artifactual variants and highlights the importance of specialized filtering approaches for several genes, including TET2 and ASXL1. We show how small changes in filtering parameters can considerably increase CHIP misclassification and reduce the effect size of epidemiological associations. Our high-fidelity call set refines previous population-based associations of CHIP with incident outcomes. For example, the annualized incidence of myeloid malignancy in individuals with small CHIP clones is 0.03% per year, which increases to 0.5% per year among individuals with very large CHIP clones. We also find a significantly lower prevalence of CHIP in individuals of self-reported Latino or Hispanic ethnicity in All of Us, highlighting the importance of including diverse populations. The standardization of CHIP calling will increase the fidelity of CHIP epidemiological work and is required for clinical CHIP diagnostic assays.

(Less)
Please use this url to cite or link to this publication:
author
; ; ; ; ; ; ; ; and , et al. (More)
; ; ; ; ; ; ; ; ; ; ; ; ; ; ; and (Less)
organization
publishing date
type
Contribution to journal
publication status
published
subject
in
Blood
volume
141
issue
18
pages
10 pages
publisher
American Society of Hematology
external identifiers
  • pmid:36652671
  • scopus:85149754214
ISSN
0006-4971
DOI
10.1182/blood.2022018825
language
English
LU publication?
yes
id
11261bc0-6ee5-4ea7-8921-65c42c1f91c7
date added to LUP
2023-05-15 10:58:59
date last changed
2024-04-05 19:21:58
@article{11261bc0-6ee5-4ea7-8921-65c42c1f91c7,
  abstract     = {{<p>Clonal hematopoiesis of indeterminate potential (CHIP) is a common form of age-related somatic mosaicism that is associated with significant morbidity and mortality. CHIP mutations can be identified in peripheral blood samples that are sequenced using approaches that cover the whole genome, the whole exome, or targeted genetic regions; however, differentiating true CHIP mutations from sequencing artifacts and germ line variants is a considerable bioinformatic challenge. We present a stepwise method that combines filtering based on sequencing metrics, variant annotation, and population-based associations to increase the accuracy of CHIP calls. We apply this approach to ascertain CHIP in ∼550 000 individuals in the UK Biobank complete whole exome cohort and the All of Us Research Program initial whole genome release cohort. CHIP ascertainment on this scale unmasks recurrent artifactual variants and highlights the importance of specialized filtering approaches for several genes, including TET2 and ASXL1. We show how small changes in filtering parameters can considerably increase CHIP misclassification and reduce the effect size of epidemiological associations. Our high-fidelity call set refines previous population-based associations of CHIP with incident outcomes. For example, the annualized incidence of myeloid malignancy in individuals with small CHIP clones is 0.03% per year, which increases to 0.5% per year among individuals with very large CHIP clones. We also find a significantly lower prevalence of CHIP in individuals of self-reported Latino or Hispanic ethnicity in All of Us, highlighting the importance of including diverse populations. The standardization of CHIP calling will increase the fidelity of CHIP epidemiological work and is required for clinical CHIP diagnostic assays.</p>}},
  author       = {{Vlasschaert, Caitlyn and Mack, Taralynn and Heimlich, J. Brett and Niroula, Abhishek and Uddin, Md Mesbah and Weinstock, Joshua and Sharber, Brian and Silver, Alexander J. and Xu, Yaomin and Savona, Michael and Gibson, Christopher and Lanktree, Matthew B. and Rauh, Michael J. and Ebert, Benjamin L. and Natarajan, Pradeep and Jaiswal, Siddhartha and Bick, Alexander G.}},
  issn         = {{0006-4971}},
  language     = {{eng}},
  number       = {{18}},
  pages        = {{2214--2223}},
  publisher    = {{American Society of Hematology}},
  series       = {{Blood}},
  title        = {{A practical approach to curate clonal hematopoiesis of indeterminate potential in human genetic data sets}},
  url          = {{http://dx.doi.org/10.1182/blood.2022018825}},
  doi          = {{10.1182/blood.2022018825}},
  volume       = {{141}},
  year         = {{2023}},
}