TY - JOUR T1 - Predicting Transcriptional Regulatory Activities with Deep Convolutional Networks JF - bioRxiv DO - 10.1101/099879 SP - 099879 AU - Joe Paggi AU - Andrew Lamb AU - Kevin Tian AU - Irving Hsu AU - Pierre-Louis Cedoz AU - Prasad Kawthekar Y1 - 2017/01/01 UR - http://biorxiv.org/content/early/2017/01/12/099879.abstract N2 - Massively parallel reporter assays (MPRAs) are a method to probe the effects of short sequences on transcriptional regulation activity. In a MPRA, short sequences are extracted from suspected regulatory regions, inserted into reporter plasmids, transfected into cell-types of interest, and the transcriptional activity of each reporter is assayed. Recently, Ernst et al. presented MPRA data covering 15750 putative regulatory regions. We trained a multitask convolutional neural network architecture using these sequence expression readouts which predicts as output the expression level outputs across four combinations of cell types and promoters. The model allows for the assigning of importance scores to each base through in silico mutagenesis, and the resulting importance scores correlated well with regions enriched for conservation and transcription factor binding. ER -