PrecisionFDA Truth Challenge V2: Calling variants from short and long reads in difficult-to-map regions

Olson, Nathan D.; Wagner, Justin; McDaniel, Jennifer; Stephens, Sarah H.; Westreich, Samuel T.; Prasanna, Anish G.; Johanson, Elaine; Boja, Emily; Maier, Ezekiel J.; Serang, Omar; Jáspez, David; Lorenzo-Salazar, José M.; Muñoz-Barrera, Adrián; Rubio-Rodríguez, Luis A.; Flores, Carlos; Kyriakidis, Konstantinos; Malousi, Andigoni; Shafin, Kishwar; Pesout, Trevor; Jain, Miten; Paten, Benedict; Chang, Pi-Chuan; Kolesnikov, Alexey; Nattestad, Maria; Baid, Gunjan; Goel, Sidharth; Yang, Howard; Carroll, Andrew; Eveleigh, Robert; Bourgey, Mathieu; Bourque, Guillaume; Li, Gen; MA, ChouXian; Tang, LinQi; DU, YuanPing; Zhang, ShaoWei; Morata, Jordi; Tonda, Raúl; Parra, Genís; Trotta, Jean-Rémi; Brueffer, Christian; Demirkaya-Budak, Sinem; Kabakci-Zorlu, Duygu; Turgut, Deniz; Kalay, Özem; Budak, Gungor; Narcı, Kübra; Arslan, Elif; Brown, Richard; Johnson, Ivan J.

PrecisionFDA Truth Challenge V2: Calling variants from short and long reads in difficult-to-map regions

Mark

Olson, Nathan D. ; Wagner, Justin ; McDaniel, Jennifer ; Stephens, Sarah H. ; Westreich, Samuel T. ; Prasanna, Anish G. ; Johanson, Elaine ; Boja, Emily ; Maier, Ezekiel J. and Serang, Omar , et al. (2022) In Cell Genomics 2(5). p.1-12

Abstract: The precisionFDA Truth Challenge V2 aimed to assess the state of the art of variant calling in challenging genomic regions. Starting with FASTQs, 20 challenge participants applied their variant-calling pipelines and submitted 64 variant call sets for one or more sequencing technologies (Illumina, PacBio HiFi, and Oxford Nanopore Technologies). Submissions were evaluated following best practices for benchmarking small variants with updated Genome in a Bottle benchmark sets and genome stratifications. Challenge submissions included numerous innovative methods, with graph-based and machine learning methods scoring best for short-read and long-read datasets, respectively. With machine learning approaches, combining multiple sequencing... (More); The precisionFDA Truth Challenge V2 aimed to assess the state of the art of variant calling in challenging genomic regions. Starting with FASTQs, 20 challenge participants applied their variant-calling pipelines and submitted 64 variant call sets for one or more sequencing technologies (Illumina, PacBio HiFi, and Oxford Nanopore Technologies). Submissions were evaluated following best practices for benchmarking small variants with updated Genome in a Bottle benchmark sets and genome stratifications. Challenge submissions included numerous innovative methods, with graph-based and machine learning methods scoring best for short-read and long-read datasets, respectively. With machine learning approaches, combining multiple sequencing technologies performed particularly well. Recent developments in sequencing and variant calling have enabled benchmarking variants in challenging genomic regions, paving the way for the identification of previously unknown clinically relevant variants. (Less)

Please use this url to cite or link to this publication: https://lup.lub.lu.se/record/321a29e2-0c7d-4659-90cc-e5a073131432

author

Olson, Nathan D. ; Wagner, Justin ; McDaniel, Jennifer ; Stephens, Sarah H. ; Westreich, Samuel T. ; Prasanna, Anish G. ; Johanson, Elaine ; Boja, Emily ; Maier, Ezekiel J. and Serang, Omar , et al. (More)

Olson, Nathan D. ; Wagner, Justin ; McDaniel, Jennifer ; Stephens, Sarah H. ; Westreich, Samuel T. ; Prasanna, Anish G. ; Johanson, Elaine ; Boja, Emily ; Maier, Ezekiel J. ; Serang, Omar ; Jáspez, David ; Lorenzo-Salazar, José M. ; Muñoz-Barrera, Adrián ; Rubio-Rodríguez, Luis A. ; Flores, Carlos ; Kyriakidis, Konstantinos ; Malousi, Andigoni ; Shafin, Kishwar ; Pesout, Trevor ; Jain, Miten ; Paten, Benedict ; Chang, Pi-Chuan ; Kolesnikov, Alexey ; Nattestad, Maria ; Baid, Gunjan ; Goel, Sidharth ; Yang, Howard ; Carroll, Andrew ; Eveleigh, Robert ; Bourgey, Mathieu ; Bourque, Guillaume ; Li, Gen ; MA, ChouXian ; Tang, LinQi ; DU, YuanPing ; Zhang, ShaoWei ; Morata, Jordi ; Tonda, Raúl ; Parra, Genís ; Trotta, Jean-Rémi ; Brueffer, Christian ^LU

; Demirkaya-Budak, Sinem ; Kabakci-Zorlu, Duygu ; Turgut, Deniz ; Kalay, Özem ; Budak, Gungor ; Narcı, Kübra ; Arslan, Elif ; Brown, Richard ; Johnson, Ivan J. ; Dolgoborodov, Alexey ; Semenyuk, Vladimir ; Jain, Amit ; Tetikol, H. Serhat ; Jain, Varun ; Ruehle, Mike ; Lajoie, Bryan ; Roddey, Cooper ; Catreux, Severine ; Mehio, Rami ; Ahsan, Mian Umair ; Liu, Qian ; Wang, Kai ; Sahraeian, Sayed Mohammad Ebrahim ; Fang, Li Tai ; Mohiyuddin, Marghoob ; Hung, Calvin ; Jain, Chirag ; Feng, Hanying ; Li, Zhipan ; Chen, Luoqi ; Sedlazeck, Fritz J. and Zook, Justin M (Less)

organization

publishing date

2022-05-11

type

Contribution to journal

publication status

published

subject

keywords

DNA, variant, shot-read sequencing, long-read sequencing, benchmark

in

Cell Genomics

volume

2

issue

5

article number

100129

pages

1 - 12

publisher

Cell Press

external identifiers

pmid:35720974
scopus:85134614281

ISSN

2666-979X

DOI

10.1016/j.xgen.2022.100129

language

English

LU publication?

yes

id

321a29e2-0c7d-4659-90cc-e5a073131432

date added to LUP

2022-05-14 14:02:21

date last changed

2025-10-14 11:42:24

@article{321a29e2-0c7d-4659-90cc-e5a073131432,
  abstract     = {{The precisionFDA Truth Challenge V2 aimed to assess the state of the art of variant calling in challenging genomic regions. Starting with FASTQs, 20 challenge participants applied their variant-calling pipelines and submitted 64 variant call sets for one or more sequencing technologies (Illumina, PacBio HiFi, and Oxford Nanopore Technologies). Submissions were evaluated following best practices for benchmarking small variants with updated Genome in a Bottle benchmark sets and genome stratifications. Challenge submissions included numerous innovative methods, with graph-based and machine learning methods scoring best for short-read and long-read datasets, respectively. With machine learning approaches, combining multiple sequencing technologies performed particularly well. Recent developments in sequencing and variant calling have enabled benchmarking variants in challenging genomic regions, paving the way for the identification of previously unknown clinically relevant variants.}},
  author       = {{Olson, Nathan D. and Wagner, Justin and McDaniel, Jennifer and Stephens, Sarah H. and Westreich, Samuel T. and Prasanna, Anish G. and Johanson, Elaine and Boja, Emily and Maier, Ezekiel J. and Serang, Omar and Jáspez, David and Lorenzo-Salazar, José M. and Muñoz-Barrera, Adrián and Rubio-Rodríguez, Luis A. and Flores, Carlos and Kyriakidis, Konstantinos and Malousi, Andigoni and Shafin, Kishwar and Pesout, Trevor and Jain, Miten and Paten, Benedict and Chang, Pi-Chuan and Kolesnikov, Alexey and Nattestad, Maria and Baid, Gunjan and Goel, Sidharth and Yang, Howard and Carroll, Andrew and Eveleigh, Robert and Bourgey, Mathieu and Bourque, Guillaume and Li, Gen and MA, ChouXian and Tang, LinQi and DU, YuanPing and Zhang, ShaoWei and Morata, Jordi and Tonda, Raúl and Parra, Genís and Trotta, Jean-Rémi and Brueffer, Christian and Demirkaya-Budak, Sinem and Kabakci-Zorlu, Duygu and Turgut, Deniz and Kalay, Özem and Budak, Gungor and Narcı, Kübra and Arslan, Elif and Brown, Richard and Johnson, Ivan J. and Dolgoborodov, Alexey and Semenyuk, Vladimir and Jain, Amit and Tetikol, H. Serhat and Jain, Varun and Ruehle, Mike and Lajoie, Bryan and Roddey, Cooper and Catreux, Severine and Mehio, Rami and Ahsan, Mian Umair and Liu, Qian and Wang, Kai and Sahraeian, Sayed Mohammad Ebrahim and Fang, Li Tai and Mohiyuddin, Marghoob and Hung, Calvin and Jain, Chirag and Feng, Hanying and Li, Zhipan and Chen, Luoqi and Sedlazeck, Fritz J. and Zook, Justin M}},
  issn         = {{2666-979X}},
  keywords     = {{DNA; variant; shot-read sequencing; long-read sequencing; benchmark}},
  language     = {{eng}},
  month        = {{05}},
  number       = {{5}},
  pages        = {{1--12}},
  publisher    = {{Cell Press}},
  series       = {{Cell Genomics}},
  title        = {{PrecisionFDA Truth Challenge V2: Calling variants from short and long reads in difficult-to-map regions}},
  url          = {{http://dx.doi.org/10.1016/j.xgen.2022.100129}},
  doi          = {{10.1016/j.xgen.2022.100129}},
  volume       = {{2}},
  year         = {{2022}},
}

Lund University Publications

LUND UNIVERSITY LIBRARIES

PrecisionFDA Truth Challenge V2: Calling variants from short and long reads in difficult-to-map regions