PT - JOURNAL ARTICLE AU - Adam J. Hockenberry AU - Luίs AN Amaral AU - Michael C. Jewett AU - Claus O. Wilke TI - Selection removes Shine-Dalgarno-like sequences from within protein coding genes AID - 10.1101/278689 DP - 2018 Jan 01 TA - bioRxiv PG - 278689 4099 - http://biorxiv.org/content/early/2018/03/08/278689.short 4100 - http://biorxiv.org/content/early/2018/03/08/278689.full AB - The Shine-Dalgarno (SD) sequence motif facilitates translation initiation and is frequently found upstream of bacterial start codons. However, thousands of instances of this motif occur throughout the middle of protein coding genes in a typical bacterial genome. Here, we use comparative evolutionary analysis to test whether SD sequences located within genes are functionally constrained. We measure the conservation of SD sequences across Gammaproteobacteria, and find that they are significantly less conserved than expected. Further, the strongest SD sequences are the least conserved whereas we find evidence of conservation for the weakest possible SD sequences given amino acid constraints. Our findings indicate that most SD sequences within genes are likely to be deleterious and removed via selection. To illustrate the origin of these deleterious costs, we show that ATG start codons are significantly depleted downstream of SD sequences within genes, highlighting the potential for these sequences to promote erroneous translation initiation.