Towards in silico CLIP-seq: predicting protein-RNA interaction via sequence-to-signal learning

Marc Horlacher*, Nils Wagner, Lambert Moyon, Klara Kuret, Nicolas Goedert, Marco Salvatore, Jernej Ule, Julien Gagneur, Ole Winther, Annalisa Marsico

*Corresponding author for this work

Research output: Contribution to journalJournal articleResearchpeer-review

8 Citations (Scopus)
13 Downloads (Pure)

Abstract

We present RBPNet, a novel deep learning method, which predicts CLIP-seq crosslink count distribution from RNA sequence at single-nucleotide resolution. By training on up to a million regions, RBPNet achieves high generalization on eCLIP, iCLIP and miCLIP assays, outperforming state-of-the-art classifiers. RBPNet performs bias correction by modeling the raw signal as a mixture of the protein-specific and background signal. Through model interrogation via Integrated Gradients, RBPNet identifies predictive sub-sequences that correspond to known and novel binding motifs and enables variant-impact scoring via in silico mutagenesis. Together, RBPNet improves imputation of protein-RNA interactions, as well as mechanistic interpretation of predictions.

Original languageEnglish
Article number180
JournalGenome Biology
Volume24
Issue number1
Number of pages37
ISSN1474-7596
DOIs
Publication statusPublished - 2023

Bibliographical note

Publisher Copyright:
© 2023, BioMed Central Ltd., part of Springer Nature.

Keywords

  • CLIP-seq
  • Computational biology
  • Deep learning
  • Protein-RNA interaction

Cite this