Abstract
We present RBPNet, a novel deep learning method, which predicts CLIP-seq crosslink count distribution from RNA sequence at single-nucleotide resolution. By training on up to a million regions, RBPNet achieves high generalization on eCLIP, iCLIP and miCLIP assays, outperforming state-of-the-art classifiers. RBPNet performs bias correction by modeling the raw signal as a mixture of the protein-specific and background signal. Through model interrogation via Integrated Gradients, RBPNet identifies predictive sub-sequences that correspond to known and novel binding motifs and enables variant-impact scoring via in silico mutagenesis. Together, RBPNet improves imputation of protein-RNA interactions, as well as mechanistic interpretation of predictions.
Original language | English |
---|---|
Article number | 180 |
Journal | Genome Biology |
Volume | 24 |
Issue number | 1 |
Number of pages | 37 |
ISSN | 1474-7596 |
DOIs | |
Publication status | Published - 2023 |
Bibliographical note
Publisher Copyright:© 2023, BioMed Central Ltd., part of Springer Nature.
Keywords
- CLIP-seq
- Computational biology
- Deep learning
- Protein-RNA interaction