PT - JOURNAL ARTICLE AU - Koo, Peter K. AU - Anand, Praveen AU - Paul, Steffan B. AU - Eddy, Sean R. TI - Inferring Sequence-Structure Preferences of RNA-Binding Proteins with Convolutional Residual Networks AID - 10.1101/418459 DP - 2018 Jan 01 TA - bioRxiv PG - 418459 4099 - http://biorxiv.org/content/early/2018/09/15/418459.short 4100 - http://biorxiv.org/content/early/2018/09/15/418459.full AB - To infer the sequence and RNA structure specificities of RNA-binding proteins (RBPs) from experiments that enrich for bound sequences, we introduce a convolutional residual network which we call ResidualBind. ResidualBind significantly outperforms previous methods on experimental data from many RBP families. We interrogate ResidualBind to identify what features it has learned from high-affinity sequences with saliency analysis along with 1st-order and 2nd-order in silico mutagenesis. We show that in addition to sequence motifs, ResidualBind learns a model that includes the number of motifs, their spacing, and both positive and negative effects of RNA structure context. Strikingly, ResidualBind learns RNA structure context, including detailed base-pairing relationships, directly from sequence data, which we confirm on synthetic data. ResidualBind is a powerful, flexible, and interpretable model that can uncover cis-recognition preferences across a broad spectrum of RBPs.