RT Journal Article SR Electronic T1 Inferring Sequence-Structure Preferences of RNA-Binding Proteins with Convolutional Residual Networks JF bioRxiv FD Cold Spring Harbor Laboratory SP 418459 DO 10.1101/418459 A1 Koo, Peter K. A1 Anand, Praveen A1 Paul, Steffan B. A1 Eddy, Sean R. YR 2018 UL http://biorxiv.org/content/early/2018/09/15/418459.abstract AB To infer the sequence and RNA structure specificities of RNA-binding proteins (RBPs) from experiments that enrich for bound sequences, we introduce a convolutional residual network which we call ResidualBind. ResidualBind significantly outperforms previous methods on experimental data from many RBP families. We interrogate ResidualBind to identify what features it has learned from high-affinity sequences with saliency analysis along with 1st-order and 2nd-order in silico mutagenesis. We show that in addition to sequence motifs, ResidualBind learns a model that includes the number of motifs, their spacing, and both positive and negative effects of RNA structure context. Strikingly, ResidualBind learns RNA structure context, including detailed base-pairing relationships, directly from sequence data, which we confirm on synthetic data. ResidualBind is a powerful, flexible, and interpretable model that can uncover cis-recognition preferences across a broad spectrum of RBPs.