Fusions4U: a resource of validated and annotated gene fusions in 328 cancer cell lines
(2025) In BMC Cancer- Abstract
- Background
Large-scale genomic rearrangements in cancer cells can lead to the creation of fusion genes which alter the expression or function of the partner genes. Some fusion genes act as tumour drivers and the use of kinase inhibitors in patients with specific fusions have led to breakthroughs in cancer therapy. The large clinical and scientific interest in fusions has led to the development of software that use RNA sequencing data to identify fusion transcripts. Unfortunately, fusion transcript callers output many predictions which lack underlying genomic rearrangements and large datasets with validated fusions are scarce.
Results
This paper and the accompanying Fusions4U web application present a resource of validated... (More) - Background
Large-scale genomic rearrangements in cancer cells can lead to the creation of fusion genes which alter the expression or function of the partner genes. Some fusion genes act as tumour drivers and the use of kinase inhibitors in patients with specific fusions have led to breakthroughs in cancer therapy. The large clinical and scientific interest in fusions has led to the development of software that use RNA sequencing data to identify fusion transcripts. Unfortunately, fusion transcript callers output many predictions which lack underlying genomic rearrangements and large datasets with validated fusions are scarce.
Results
This paper and the accompanying Fusions4U web application present a resource of validated and annotated gene fusions for 328 cell lines from the Cancer Cell Line Encyclopedia. Predicted fusion transcripts from Arriba and STAR-Fusion were analysed with our published validation pipeline that uses matched whole-genome sequencing data to identify discordantly mapped read pairs and candidate genomic breakpoints that support genuine fusion events. This resulted in 8,753 and 2,244 validated fusion transcripts for Arriba and STAR-Fusion predictions, respectively, with 1,596 fusions common to both. Additional layers of annotation include alternative splicing of fusion transcripts, kinases, microRNA host genes, genes in the COSMIC Cancer Gene Census, as well as known fusion gene pairs from the Mitelman Database of Chromosome Aberrations and Gene Fusions in Cancer and the TumorFusions dataset. We furthermore analysed information about fusion genes together with cell line data from the PRISM drug repurposing screening as an example of how this dataset can be used.
Conclusions
This resource can be used to design experiments for functional studies and drug development, alone or in combination with publicly available information for the Cancer Cell Line Encyclopedia cell lines. The large collection of validated fusion transcripts with candidate genomic breakpoints can also be used in development and evaluation of bioinformatic tools for fusion transcript detection. (Less)
Please use this url to cite or link to this publication:
https://lup.lub.lu.se/record/5ec2d2db-4a43-40a8-ad76-ac511734dd1a
- author
- Alamshahi, Arianna
LU
and Persson, Helena
LU
- organization
- publishing date
- 2025-12-20
- type
- Contribution to journal
- publication status
- epub
- subject
- in
- BMC Cancer
- publisher
- BioMed Central (BMC)
- external identifiers
-
- pmid:41421967
- ISSN
- 1471-2407
- DOI
- 10.1186/s12885-025-15441-w
- language
- English
- LU publication?
- yes
- id
- 5ec2d2db-4a43-40a8-ad76-ac511734dd1a
- date added to LUP
- 2025-12-22 21:21:01
- date last changed
- 2025-12-23 08:56:29
@article{5ec2d2db-4a43-40a8-ad76-ac511734dd1a,
abstract = {{Background<br/>Large-scale genomic rearrangements in cancer cells can lead to the creation of fusion genes which alter the expression or function of the partner genes. Some fusion genes act as tumour drivers and the use of kinase inhibitors in patients with specific fusions have led to breakthroughs in cancer therapy. The large clinical and scientific interest in fusions has led to the development of software that use RNA sequencing data to identify fusion transcripts. Unfortunately, fusion transcript callers output many predictions which lack underlying genomic rearrangements and large datasets with validated fusions are scarce.<br/><br/>Results<br/>This paper and the accompanying Fusions4U web application present a resource of validated and annotated gene fusions for 328 cell lines from the Cancer Cell Line Encyclopedia. Predicted fusion transcripts from Arriba and STAR-Fusion were analysed with our published validation pipeline that uses matched whole-genome sequencing data to identify discordantly mapped read pairs and candidate genomic breakpoints that support genuine fusion events. This resulted in 8,753 and 2,244 validated fusion transcripts for Arriba and STAR-Fusion predictions, respectively, with 1,596 fusions common to both. Additional layers of annotation include alternative splicing of fusion transcripts, kinases, microRNA host genes, genes in the COSMIC Cancer Gene Census, as well as known fusion gene pairs from the Mitelman Database of Chromosome Aberrations and Gene Fusions in Cancer and the TumorFusions dataset. We furthermore analysed information about fusion genes together with cell line data from the PRISM drug repurposing screening as an example of how this dataset can be used.<br/><br/>Conclusions<br/>This resource can be used to design experiments for functional studies and drug development, alone or in combination with publicly available information for the Cancer Cell Line Encyclopedia cell lines. The large collection of validated fusion transcripts with candidate genomic breakpoints can also be used in development and evaluation of bioinformatic tools for fusion transcript detection.}},
author = {{Alamshahi, Arianna and Persson, Helena}},
issn = {{1471-2407}},
language = {{eng}},
month = {{12}},
publisher = {{BioMed Central (BMC)}},
series = {{BMC Cancer}},
title = {{Fusions4U: a resource of validated and annotated gene fusions in 328 cancer cell lines}},
url = {{http://dx.doi.org/10.1186/s12885-025-15441-w}},
doi = {{10.1186/s12885-025-15441-w}},
year = {{2025}},
}