Skip to main content

Lund University Publications

LUND UNIVERSITY LIBRARIES

AIRR-C IG Reference Sets : curated sets of immunoglobulin heavy and light chain germline genes

Collins, Andrew M. ; Ohlin, Mats LU orcid ; Corcoran, Martin ; Heather, James M. ; Ralph, Duncan ; Law, Mansun ; Martínez-Barnetche, Jesus ; Ye, Jian ; Richardson, Eve and Gibson, William S. , et al. (2023) In Frontiers in Immunology 14.
Abstract

Introduction: Analysis of an individual’s immunoglobulin (IG) gene repertoire requires the use of high-quality germline gene reference sets. When sets only contain alleles supported by strong evidence, AIRR sequencing (AIRR-seq) data analysis is more accurate and studies of the evolution of IG genes, their allelic variants and the expressed immune repertoire is therefore facilitated. Methods: The Adaptive Immune Receptor Repertoire Community (AIRR-C) IG Reference Sets have been developed by including only human IG heavy and light chain alleles that have been confirmed by evidence from multiple high-quality sources. To further improve AIRR-seq analysis, some alleles have been extended to deal with short 3’ or 5’ truncations that can lead... (More)

Introduction: Analysis of an individual’s immunoglobulin (IG) gene repertoire requires the use of high-quality germline gene reference sets. When sets only contain alleles supported by strong evidence, AIRR sequencing (AIRR-seq) data analysis is more accurate and studies of the evolution of IG genes, their allelic variants and the expressed immune repertoire is therefore facilitated. Methods: The Adaptive Immune Receptor Repertoire Community (AIRR-C) IG Reference Sets have been developed by including only human IG heavy and light chain alleles that have been confirmed by evidence from multiple high-quality sources. To further improve AIRR-seq analysis, some alleles have been extended to deal with short 3’ or 5’ truncations that can lead them to be overlooked by alignment utilities. To avoid other challenges for analysis programs, exact paralogs (e.g. IGHV1-69*01 and IGHV1-69D*01) are only represented once in each set, though alternative sequence names are noted in accompanying metadata. Results and discussion: The Reference Sets include less than half the previously recognised IG alleles (e.g. just 198 IGHV sequences), and also include a number of novel alleles: 8 IGHV alleles, 2 IGKV alleles and 5 IGLV alleles. Despite their smaller sizes, erroneous calls were eliminated, and excellent coverage was achieved when a set of repertoires comprising over 4 million V(D)J rearrangements from 99 individuals were analyzed using the Sets. The version-tracked AIRR-C IG Reference Sets are freely available at the OGRDB website (https://ogrdb.airr-community.org/germline_sets/Human) and will be regularly updated to include newly observed and previously reported sequences that can be confirmed by new high-quality data.

(Less)
Please use this url to cite or link to this publication:
author
; ; ; ; ; ; ; ; and , et al. (More)
; ; ; ; ; ; ; ; ; ; ; ; ; and (Less)
organization
publishing date
type
Contribution to journal
publication status
published
subject
keywords
heavy chain, IGHD, IGHJ, IGHV genes, immunoglobulin, light chain
in
Frontiers in Immunology
volume
14
article number
1330153
publisher
Frontiers Media S. A.
external identifiers
  • pmid:38406579
  • scopus:85185503855
ISSN
1664-3224
DOI
10.3389/fimmu.2023.1330153
language
English
LU publication?
yes
id
0fc86b4d-7bd4-4dec-8a55-0aa277fa584b
date added to LUP
2024-03-18 16:05:54
date last changed
2024-04-29 15:29:43
@article{0fc86b4d-7bd4-4dec-8a55-0aa277fa584b,
  abstract     = {{<p>Introduction: Analysis of an individual’s immunoglobulin (IG) gene repertoire requires the use of high-quality germline gene reference sets. When sets only contain alleles supported by strong evidence, AIRR sequencing (AIRR-seq) data analysis is more accurate and studies of the evolution of IG genes, their allelic variants and the expressed immune repertoire is therefore facilitated. Methods: The Adaptive Immune Receptor Repertoire Community (AIRR-C) IG Reference Sets have been developed by including only human IG heavy and light chain alleles that have been confirmed by evidence from multiple high-quality sources. To further improve AIRR-seq analysis, some alleles have been extended to deal with short 3’ or 5’ truncations that can lead them to be overlooked by alignment utilities. To avoid other challenges for analysis programs, exact paralogs (e.g. IGHV1-69*01 and IGHV1-69D*01) are only represented once in each set, though alternative sequence names are noted in accompanying metadata. Results and discussion: The Reference Sets include less than half the previously recognised IG alleles (e.g. just 198 IGHV sequences), and also include a number of novel alleles: 8 IGHV alleles, 2 IGKV alleles and 5 IGLV alleles. Despite their smaller sizes, erroneous calls were eliminated, and excellent coverage was achieved when a set of repertoires comprising over 4 million V(D)J rearrangements from 99 individuals were analyzed using the Sets. The version-tracked AIRR-C IG Reference Sets are freely available at the OGRDB website (https://ogrdb.airr-community.org/germline_sets/Human) and will be regularly updated to include newly observed and previously reported sequences that can be confirmed by new high-quality data.</p>}},
  author       = {{Collins, Andrew M. and Ohlin, Mats and Corcoran, Martin and Heather, James M. and Ralph, Duncan and Law, Mansun and Martínez-Barnetche, Jesus and Ye, Jian and Richardson, Eve and Gibson, William S. and Rodriguez, Oscar L. and Peres, Ayelet and Yaari, Gur and Watson, Corey T. and Lees, William D.}},
  issn         = {{1664-3224}},
  keywords     = {{heavy chain; IGHD; IGHJ; IGHV genes; immunoglobulin; light chain}},
  language     = {{eng}},
  publisher    = {{Frontiers Media S. A.}},
  series       = {{Frontiers in Immunology}},
  title        = {{AIRR-C IG Reference Sets : curated sets of immunoglobulin heavy and light chain germline genes}},
  url          = {{http://dx.doi.org/10.3389/fimmu.2023.1330153}},
  doi          = {{10.3389/fimmu.2023.1330153}},
  volume       = {{14}},
  year         = {{2023}},
}