Abstract
Gene fusions are common cancer drivers and therapeutic targets, but clinical-grade open-source bioinformatic tools are lacking. Here, we introduce a fusion detection method named SplitFusion, which is fast by leveraging Burrows-Wheeler Aligner-maximal exact match (BWA-MEM) split alignments, can detect cryptic splice-site fusions (e.g., EML4::ALK v3b and ARv7), call fusions involving highly repetitive gene partners (e.g., CIC::DUX4), and infer frame-ness and exon-boundary alignments for functional prediction and minimizing false positives. Using 1,848 datasets of various sizes, SplitFusion demonstrated superior sensitivity and specificity compared to three other tools. In 1,076 formalin-fixed paraffin-embedded lung cancer samples, SplitFusion identified novel fusions and revealed that EML4::ALK variant 3 was associated with multiple fusion variants coexisting in the same tumor. Additionally, SplitFusion can call targeted splicing variants. Using data from 515 The Cancer Genome Atlas (TCGA) samples, SplitFusion showed the highest sensitivity and uncovered two cases of SLC34A2::ROS1 that were missed in previous studies. These capabilities make SplitFusion highly suitable for clinical applications and the study of fusion-defined tumor heterogeneity.
Original language | English |
---|---|
Article number | 101174 |
Journal | Patterns |
Volume | 6 |
Issue number | 2 |
DOIs | |
State | Published - 14 Feb 2025 |
Keywords
- ALK variant
- AMP
- ARv7
- DUX4
- FFPE samples
- RNA-seq
- ROS1
- TCGA
- fusion detection tool
- tumor heterogeneity
All Science Journal Classification (ASJC) codes
- General Decision Sciences