Advanced

Computational Tools for Splicing Defect Prediction in Breast/Ovarian Cancer Genes : How Efficient Are They at Predicting RNA Alterations?

Moles-Fernández, Alejandro ; Duran-Lozano, Laura LU ; Montalban, Gemma ; Bonache, Sandra ; López-Perolio, Irene ; Menéndez, Mireia ; Santamariña, Marta ; Behar, Raquel ; Blanco, Ana and Carrasco, Estela , et al. (2018) In Frontiers in Genetics 9. p.1-12
Abstract

In silico tools for splicing defect prediction have a key role to assess the impact of variants of uncertain significance. Our aim was to evaluate the performance of a set of commonly used splicing in silico tools comparing the predictions against RNA in vitro results. This was done for natural splice sites of clinically relevant genes in hereditary breast/ovarian cancer (HBOC) and Lynch syndrome. A study divided into two stages was used to evaluate SSF-like, MaxEntScan, NNSplice, HSF, SPANR, and dbscSNV tools. A discovery dataset of 99 variants with unequivocal results of RNA in vitro studies, located in the 10 exonic and 20 intronic nucleotides adjacent to exon-intron boundaries of BRCA1, BRCA2, MLH1, MSH2, MSH6, PMS2, ATM, BRIP1,... (More)

In silico tools for splicing defect prediction have a key role to assess the impact of variants of uncertain significance. Our aim was to evaluate the performance of a set of commonly used splicing in silico tools comparing the predictions against RNA in vitro results. This was done for natural splice sites of clinically relevant genes in hereditary breast/ovarian cancer (HBOC) and Lynch syndrome. A study divided into two stages was used to evaluate SSF-like, MaxEntScan, NNSplice, HSF, SPANR, and dbscSNV tools. A discovery dataset of 99 variants with unequivocal results of RNA in vitro studies, located in the 10 exonic and 20 intronic nucleotides adjacent to exon-intron boundaries of BRCA1, BRCA2, MLH1, MSH2, MSH6, PMS2, ATM, BRIP1, CDH1, PALB2, PTEN, RAD51D, STK11, and TP53, was collected from four Spanish cancer genetic laboratories. The best stand-alone predictors or combinations were validated with a set of 346 variants in the same genes with clear splicing outcomes reported in the literature. Sensitivity, specificity, accuracy, negative predictive value (NPV) and Mathews Coefficient Correlation (MCC) scores were used to measure the performance. The discovery stage showed that HSF and SSF-like were the most accurate for variants at the donor and acceptor region, respectively. The further combination analysis revealed that HSF, HSF+SSF-like or HSF+SSF-like+MES achieved a high performance for predicting the disruption of donor sites, and SSF-like or a sequential combination of MES and SSF-like for predicting disruption of acceptor sites. The performance confirmation of these last results with the validation dataset, indicated that the highest sensitivity, accuracy, and NPV (99.44%, 99.44%, and 96.88, respectively) were attained with HSF+SSF-like or HSF+SSF-like+MES for donor sites and SSF-like (92.63%, 92.65%, and 84.44, respectively) for acceptor sites. We provide recommendations for combining algorithms to conduct in silico splicing analysis that achieved a high performance. The high NPV obtained allows to select the variants in which the study by in vitro RNA analysis is mandatory against those with a negligible probability of being spliceogenic. Our study also shows that the performance of each specific predictor varies depending on whether the natural splicing sites are donors or acceptors.

(Less)
Please use this url to cite or link to this publication:
author
, et al. (More)
(Less)
publishing date
type
Contribution to journal
publication status
published
subject
in
Frontiers in Genetics
volume
9
article number
366
pages
1 - 12
publisher
Frontiers
external identifiers
  • scopus:85053134220
  • pmid:30233647
ISSN
1664-8021
DOI
10.3389/fgene.2018.00366
language
English
LU publication?
no
id
fd23da23-ad60-4233-89d7-23662fe869c6
date added to LUP
2019-05-09 10:03:47
date last changed
2019-12-10 08:03:31
@article{fd23da23-ad60-4233-89d7-23662fe869c6,
  abstract     = {<p>In silico tools for splicing defect prediction have a key role to assess the impact of variants of uncertain significance. Our aim was to evaluate the performance of a set of commonly used splicing in silico tools comparing the predictions against RNA in vitro results. This was done for natural splice sites of clinically relevant genes in hereditary breast/ovarian cancer (HBOC) and Lynch syndrome. A study divided into two stages was used to evaluate SSF-like, MaxEntScan, NNSplice, HSF, SPANR, and dbscSNV tools. A discovery dataset of 99 variants with unequivocal results of RNA in vitro studies, located in the 10 exonic and 20 intronic nucleotides adjacent to exon-intron boundaries of BRCA1, BRCA2, MLH1, MSH2, MSH6, PMS2, ATM, BRIP1, CDH1, PALB2, PTEN, RAD51D, STK11, and TP53, was collected from four Spanish cancer genetic laboratories. The best stand-alone predictors or combinations were validated with a set of 346 variants in the same genes with clear splicing outcomes reported in the literature. Sensitivity, specificity, accuracy, negative predictive value (NPV) and Mathews Coefficient Correlation (MCC) scores were used to measure the performance. The discovery stage showed that HSF and SSF-like were the most accurate for variants at the donor and acceptor region, respectively. The further combination analysis revealed that HSF, HSF+SSF-like or HSF+SSF-like+MES achieved a high performance for predicting the disruption of donor sites, and SSF-like or a sequential combination of MES and SSF-like for predicting disruption of acceptor sites. The performance confirmation of these last results with the validation dataset, indicated that the highest sensitivity, accuracy, and NPV (99.44%, 99.44%, and 96.88, respectively) were attained with HSF+SSF-like or HSF+SSF-like+MES for donor sites and SSF-like (92.63%, 92.65%, and 84.44, respectively) for acceptor sites. We provide recommendations for combining algorithms to conduct in silico splicing analysis that achieved a high performance. The high NPV obtained allows to select the variants in which the study by in vitro RNA analysis is mandatory against those with a negligible probability of being spliceogenic. Our study also shows that the performance of each specific predictor varies depending on whether the natural splicing sites are donors or acceptors.</p>},
  author       = {Moles-Fernández, Alejandro and Duran-Lozano, Laura and Montalban, Gemma and Bonache, Sandra and López-Perolio, Irene and Menéndez, Mireia and Santamariña, Marta and Behar, Raquel and Blanco, Ana and Carrasco, Estela and López-Fernández, Adrià and Stjepanovic, Neda and Balmaña, Judith and Capellá, Gabriel and Pineda, Marta and Vega, Ana and Lázaro, Conxi and de la Hoya, Miguel and Diez, Orland and Gutiérrez-Enríquez, Sara},
  issn         = {1664-8021},
  language     = {eng},
  pages        = {1--12},
  publisher    = {Frontiers},
  series       = {Frontiers in Genetics},
  title        = {Computational Tools for Splicing Defect Prediction in Breast/Ovarian Cancer Genes : How Efficient Are They at Predicting RNA Alterations?},
  url          = {http://dx.doi.org/10.3389/fgene.2018.00366},
  doi          = {10.3389/fgene.2018.00366},
  volume       = {9},
  year         = {2018},
}