Pervasive selection against microRNA target sites in human populations

Andrea Hatlen; Antonio Marco

doi:10.1101/420646

ABSTRACT

MicroRNA target sites are often conserved during evolution and purifying selection to maintain such sites is expected. On the other hand, comparative analyses identified a paucity of microRNA target sites in co-expressed transcripts, and novel target sites can potentially be deleterious. We proposed that selection against novel target sites pervasive. The analysis of derived allele frequencies revealed that, when the derived allele is a target site, the proportion of non-target sites is higher than expected, particularly for highly expressed microRNAs. Thus, new alleles generating novel microRNA target sites can be deleterious and selected against. When we analysed ancestral target sites the derived (non-target) allele frequency does not show statistical support for microRNA target allele conservation. We investigated the joint effects of microRNA conservation and expression and found that selection against microRNA target sites depends mostly on the expression level of the microRNA. We identified microRNA target sites with relatively high levels of population differentiation. However, when we analyse separately target sites in which the target allele is ancestral to the population, the proportion of SNPs with high Fst significantly increases. These findings support population differentiation is more likely in target sites that are lost than in the gain of new target sites. Our results indicate that selection against novel microRNA target sites is prevalent and, although individual sites may have a weak selective pressure, the overall effect across untranslated regions is not negligible and should be accounted when studying the evolution of genomic sequences.

INTRODUCTION

MicroRNAs are small endogenous RNAs that can regulate virtually any type of biological process. Following their discovery in humans this century (Pasquinelli et al. 2000), there are now over 2500 human microRNA precursors annotated in miRBase (Kozomara and Griffiths-Jones 2014), although less than 900 are classified with high confidence. Soon after microRNAs were found in multiple animal species (Lagos-Quintana et al. 2001; Lau et al. 2001; Lee and Ambros 2001), the first target prediction tools became available (Enright et al. 2003; Lewis et al. 2003; Stark et al. 2003). Only in the last few years have these developments permitted the evolutionary analysis of target sites (Farh et al. 2005; Grün et al. 2005; Lewis et al. 2005; Stark et al. 2005; Sood et al. 2006) revealing that many microRNA target sites are highly conserved among species. In contrast, whilst some microRNA families have been conserved for millions of years, their targets appear to differ between species (see for instance (Hui et al. 2013). Indeed, evidence from vertebrates suggests that gains and losses of target sites may be more important than changes in the microRNAs themselves during the evolution of microRNA-based gene regulation, as microRNA genes are usually highly conserved (see Discussion in (Marco 2018a)).

Several studies have found that gene transcripts are depleted of target sites for co-expressed microRNAs (Farh et al. 2005; Stark et al. 2005; Sood et al. 2006). In particular long 3’ UTRs might accumulate microRNA target sites by random mutation, yet they actually have a lower frequency than expected by chance, suggesting that there has been selection against these sequences (Farh et al. 2005). These missing sites have been called ‘anti-targets’ (Bartel and Chen 2004; Farh et al. 2005). Interestingly, target sites for the same microRNAs tend to be conserved in transcripts expressed in neighboring tissues (Stark et al. 2005). These studies have shown that selection against microRNA target sites can be inferred from comparisons among distantly related species. However, the relative impact of selection against microRNA target sites in human populations is not known.

Analysis of human populations has suggested purifying selection was particularly strong at microRNA target sites, even in non-conserved sites (Chen and Rajewsky 2006; Saunders et al. 2007). Negative selection against gaining microRNA target sites has also been described in Yoruban populations, but the pattern was not detected in other populations (Chen and Rajewsky 2006). Correspondingly, we have previously found evidence of selection against microRNA target sites in a study in Drosophila populations (Marco 2015). Specifically, we found selection against target sites of maternal microRNAs in maternally deposited transcripts within the egg and early ambryo. More recently it has been shown that this effect is particularly strong for the mir-309 cluster, whose microRNAs are abundant in the egg and almost absent in the zygote (Zhou et al. 2018). Characterising this type of selection in humans would reveal to which extent it shapes our genomes. However, the strength and prevalence of selection against target sites is human populations is still unknown. Here we investigate single nucleotide polymorphisms at human microRNA target sites and evaluated the impact of selection against target sites. To do so, we consider pairs of segregating alleles in which one of the allele as a target site and the other was not. Then we compare the allele frequency distributions with that of estimated background distributions to quantify the strength of selection for or against microRNA target sites.

RESULTS

Bias towards microRNA non-target alleles in human populations

In order to investigate the selective pressures on microRNA target sites in human populations, we first mapped human single-nucleotide polymorphisms (SNPs) to putative canonical microRNA target sites such that one allele is a target site and the alternative allele is not a target site (see Methods for details). The non-target allele in this pair is called a ‘near-target’ (Marco 2018b). We first compared the derived allele frequency (DAF) distribution of target sites for highly expressed microRNAs with a background (expected) distribution obtained by conducting the same analysis on the reverse complement sequences of 3’UTRs (see Methods). We considered those SNPs for which the derived allele is the target allele. If the overall selective pressure is to fix new target sites (positive selection favouring new interactions), DAF values should be the higher than expected. On the other hand, if there is selection against microRNA target sites, the DAF distribution should be skewed towards smaller values (Figure 1). We observe an overall excess of low frequency derived alleles (Figure 2A; p=0.009, one-tailed Kolmogorov-Smirnov test).

Figure 1. Analysis of derived allele frequencies at target sites.

Pairs of alleles where one allele is a target site and the other is not are identified, and only those where the non-target allele was the ancestral state were further considered. The derived (not ancestral) allele frequency distribution will be skewed towards the target allele (right) if selection favours the emergence of new microRNA target sites, or to the non-target allele (left) if selection acts against new targets.

Figure 2. Derived allele frequencies (DAF) at microRNA target sites.

(A) Derive allele frequency distribution of microRNA target sites, where the derived allele is a target site, compared to a background distribution for the whole dataset. (B) As in A but for interactions between coexpressed microRNA/transcript pairs in lung tissue. (C) Significance levels for differences in the DAF distribution for ten tissues expressed as −log10[q-value] (see Methods). The vertical line indicates the threshold for one expected false positive.

It is expected that a selective pressure on co-expressed microRNA/mRNA pairs will be tissuespecific. Therefore, we compared the DAF distributions as above for ten specific tissues, considering only interactions in which the microRNA and the potential target are co-expressed. For instance, for lung co-expressed microRNA/transcripts we found a DAF distribution significantly skewed towards the non-target allele, indicating selective pressures against the target allele (onetailed Kolmogorov-Smirnov test, p=0.00012; Figure 2B). For all tissues analysed we observe a shift in the derived allele frequencies towards the non-target allele (Supplementary Figure 1), all but breast within a 10% False Discovery Rate (less than one expected false positive; Figure 2C, Supplementary Table 1). The poor statistical signal in breast tissue can be easily explained by a very small sample size (42 SNPs in total).

Genes can have alternative polyadenylation in different tissues, affecting microRNA target sites. However, in the context of microRNAs, this has been explored mostly in cell lines (Nam et al. 2014). We found alternative polyadenilation information for two of the tissues here investigated: blood and kidney (Müller et al. 2014). When we consider target sites whose 3’UTR has been experimentally detected in those tissues, the shift to the non-target allele is significant (Supplementary Table 1; Supplementary Figure 1).

Alternatively, we can compare the allele frequency distributions between target sites for highly expressed microRNAs versus target sites for non-expressed (non-detected) microRNAs in a given tissue. To do so we identified microRNAs with no read counts detected across multiple expression experiments (see Methods) and used their target sites as our background (expected) frequencies. Consistently with the results above, we found that for most analysed tissues the DAF distribution when the ancestral allele is a non-target is skewed to the non-target allele (Supplementary Figure 2, Supplementary Table 2). In summary, the distribution of allele frequencies shows evidence of selection against microRNA target sites in human populations. Again, the results are consistent when taking into account alternative polyadenylation in blood and kidney (Supplementary Table 2; Supplementary Figure 2).

Wobble pairing has been also observed in microRNA/transcript interactions. Under our hypothesis, selection against target sites will be weaker at sites in which the non-target allele is a potential target with a wobble pairing (GU). Hence, we partitioned the dataset of targets in those whose near-target allele is a wobble paired position and those which are not. We observed that for near-target-wobble the shift to the non-target allele is not significant (0.682) whilst for the non-wobble there was a significant shift (0.009, Supplementary Figure 3). These results furthers strengths the evidence for selection against microRNA target sites.

The effect of microRNA expression levels and evolutionary conservation

We next considered the potential impact of microRNA conservation. On the one hand, evolutionarily conserved microRNAs may have a weaker effect on selection against microRNA targets, as partly deleterious target alleles may have been cleared from the population. On the other hand, evolutionarily conserved microRNAs tend to be highly expressed, and therefore it is expected that the selective pressure to avoid targets for such microRNAs should be stronger. In other words, expression and conservation are not independent to each other. We therefore included both factors in the analysis: the level of expression of the microRNA and a measure of the phylogenetic conservation of the microRNA sequence.

We analysed cases where the derived allele is a target site (as in the previous section); testing whether frequency spectra were different across different levels of microRNA conservation (humanprimate specific, conserved in mammals, and conserved in animals) and different levels of expression (low, mid and high as described in Methods). To evaluate the joint effect of conservation and expression, we build a linear model of ranked independent variables with interactions (this is equivalent to the Scheirer-Ray-Hare test ((Sokal and Rohlf 1995), pp.445; see Methods). The interaction term was not statistically relevant in any of the models (Supplementary Table 3). In general, from the fitted model it is evident that expression has a significant impact in the microRNA selective avoidance whilst conservation has no detectable impact, once the expression level has been taken into account (Figure 3, Supplementary Table 3).

Figure 3. The effect of microRNA conservation and expression level on the frequency of target alleles.

Rank differences in our linear model (see Methods) explained by expression (solid lines) and conservation (dashed lines) levels for ten different tissues. Highly expressed and conserved in animals microRNAs were taken as the intercept of the model (rank zero).

More specifically, in Figure 2 the contribution of expression and conservation to the ranks in the linear model are plotted for all ten tissues. The smaller the rank, the greater the shift to the ancestral non-target allele in a comparison of derived allele frequencies. Overall, we observe that as the expression level increases the rank decreases, whilst the rank is similar for all three conservation levels (Figure 2). The effect is particularly clear in lung, liver and kidney. For brain ans breast we did not find such an association (Figure 2; Supplementary Table 3). From these analyses we concluded that expression level, rather than microRNA evolutionary conservation, determines the selective pressure against microRNA target sites.

Population differentiation at target sites

If a microRNA target site is under selective constraints, we should expect differentiation among populations at these sites to be relatively low. To investigate this prediction, we grouped SNPs at target sites for highly expressed microRNAs according to their Fst and compared the relative frequency of these SNPs compared to the background (see Methods). However, in Figure 3A we observed an enrichment in high Fst values. To further explore the relative contribution of microRNA target gains and losses during population differentiation we split the dataset in two groups, depending on whether the target allele was ancestral or derived. Strikingly, polymorphic microRNA target sites where the ancestral allele is the target show high levels of population differentiation (Figure 3A, red line). In contrast, for novel microRNA target sites there is a deficit of high Fst SNPs compared to the background expectations (Figure 4A, blue line). When we repeated the analysis for moderately expressed microRNA we found a similar pattern (Figure 4B). This may reflect that positive selection driving the generation of novel microRNA target sites is negligible. Also, evolution by microRNA target site loss seems important in human populations. In Table 1 we show SNPs in microRNA target sites with a Fst greater than 0.6. As suggested in Figure 4A and 4B, in a majority of target sites with a high degree of population differentiation the target allele was ancestral (21 out of 35), many being probably target losses in out-of-Africa populations (12 out of 21). Predicted target sites for microRNAs with no detectable expression level did not show any significant level of population differentiation (Figure 4C).

Figure 4. Fst enrichment in polymorphic microRNA target sites.

Comparisons of the Fst values for SNPs within microRNA target sites with control sites (background sites, see Methods). For each range of Fst values, the proportion of sites in that range was calculated for the microRNA target sites (P_t) and for the background sites (P_b). The value plotted is the log(P_t/P_b). (A) All targets sites for highly expressed microRNAs (black, dashed line); target sites where the target allele is ancestral (red) and targets sites where the target site is not ancestral (blue). (B and C) As in (A) but for moderately expressed and zero expressed microRNAs respectivelly. (D) Regression between absolute latitude in samples human populations and the target allele frequency associated to SNP rs2470102.

View this table:

Table 1.

Target allele frequencies at polymorphic sites with a high Fst values.

One of the SNPs in Table 1 has been recently associated with skin colour variation in Indian populations: rs2470102 (Sarkar and Nandineni 2018). Interestingly, this is in a predicted a target site for miR-1180-3p in MYEF2. This gene has been associated with skin colour as well, although the functional relationship is not clear (Mishra et al. 2017). We investigated a potential relationship between absolute latitude (distance to the Ecuador) and the frequency of the target allele and we found a strong association (Figure 4D; p=0.0015, R-squared adjusted = 0.4223). Given that the microRNA itself originated in primates (Arcila et al. 2014), and that the target allele is ancestral and conserved, the emergence of a new microRNA may have imposed a selective pressure to loss the target site in populations with less exposure to UV light. However interesting, these association remains speculative at the moment, and demographic differences between populations may also have an impact in this observed association.

These results further suggests that loss rather than conservation of targets may be more frequent in population dynamics. Thus, we revisited the analysis of derived allele frequencies in Figure 2 and considered this time target sites where the ancestral allele was a target site. If there is a detectable selective pressure to maintain target sites we will expect the derived allele frequency to be biased towards the target (ancestral) allele. In Figure 5 we plot the bias (as the D-statistic in a Kolmogorov-Smirnov test) and the significance (as the log10 of the p-value) and we observe that in this set the bias was not significant (blue dots). On the contrary, if we compare these results with those obtained for non-targets as ancestral sequences we observe a clear and significant biased towards the non-target allele (Figure 5).

Figure 5.

Allele preference for ancestral targets and non-targets. The shift of the distribution of derived allele frequencies is measured as the D statistic form the Kolmogorov-Smirnov test (x-axis). When the distribution is biased towards the derived allele the D statistic is negative (-D^-), and if the bias is to the ancestral allele the D value is D⁺. The y-axis plots the −log10 of the p-value obtained from the corresponding one-tailed KS test. For each tissue the graph represents the DAF bias when the ancestral allele is a target (blue circles) and when it is a non-target (red circles).

In one case we found a microRNA with a SNP within its seed sequence (the region that determines the targeting property of the microRNA), which shows some evidence of population differentiation (rs7210937; Fst = 0.3314). In this case, the Fst value between African and European populations is remarkably high (Fst = 0.6129; Nei’s estimate (Nei 1986; Bhatia et al. 2013)). In European populations 92.5% of sequenced individuals present the ancestral form of miR-1269b, whilst in African population, the derived version is more frequent (59.8%). As a shift in the seed sequence may have an impact on the evolution of 3’UTRs, we further studied target sites whose ancestral form is a target for the derived miR-1269 microRNA (344 in total). Then we compared the frequency of the target site allele between European and African populations. We found that in African populations the frequency of target alleles is lower than in European populations at these sites (Wilcoxon non-parametric test for paired samples; p=0.021) whilst for the ancestral miR-1269b we did not find any significant difference. These results suggest that a shift in the allele frequencies affecting the seed sequence of a microRNA can have an effect on the allele frequencies at the novel target sites, specifically toward the non-target allele.

DISCUSSION

The study of allele frequencies has been extensively used to detect selective pressures in human populations (Chen and Rajewsky 2006; Barreiro et al. 2008; Enard et al. 2014). Here we show that the patterns of allele frequencies at 3’UTRs show evidence of selection against most microRNA target sites. First, the allele frequencies at target sites are biased towards the non-target allele when the derived allele is a target sequence. These effects are strongest in the cases where the corresponding microRNAs are highly expressed, suggesting that interaction between the microRNA and the target is a key is the source of selection against target sequences. The microRNAs that have been conserved over longer periods of vertebrate evolution did not impose detectably greater selection against their target sequences, once the effect of expression levels had been taken into account. On the other hand we failed to detect any noticeable effect when the target sites was ancestral, and therefore we did not find statistical signal supporting selection to maintain existing microRNA target sites. We did not believe that means that there is no purifying selection to keep functional sites, but that this is comparably small compare to selection to avoid new, potentially deleterious, microRNA target sites. As a matter of fact, novel microRNA target sites has been identified in various studies as disease causing mutations (Abelson et al. 2005; Dusl et al. 2015). The most popular microRNA target prediction programs rely on target site conservation to reduce the number of false positives (Agarwal et al. 2015) and/or do not provide a stand-alone version to run on custom datasets (Paraskevopoulou et al. 2013). Therefore, we used a naive microRNA target prediction method that reports canonical targets and near-target sites (Marco 2018b). That allowed us to study pairs of alleles segregating at target sites without any other constraint. On the other hand, we would expect a high number of false positive in target predictions (reviewed in (Alexiou et al. 2009)). Remarkably, we found a significant pattern of selection against microRNA target sites. This reinforces our initial hypothesis and suggest that, if we would be able to restrict the analysis to bona fide target sites, the signal might be stronger. One possibility is to evaluate experimentally validated target sites. However, these experiments are based on reference genomes, so segregating target sites whose target allele is not in the reference genome will be lost from the analysis. The way forward may be to perform high-thoughtput microRNA target experiments, like HITS-CLIP (Chi et al. 2009), in cells derived from different populations. The continuous drop in the costs of sequencing and high-throughput experiments may allow this in the near future. Indeed, high-throughput experimental evaluation of segregating alleles at regulatory motifs (transcription factor binding sites, RNA binding sites, etcetera) is a promising area of research which will help us to move from a typological (reference genome) to a population view of gene regulation.

Another way to study the effect of selection in populations is to evaluate the population differentiation (Coop et al. 2009; Li et al. 2012). We found that, at microRNA target sites in general, there is an enrichment in high population differentiation. This result was observed by by Li et al. (Li et al. 2012). However, we found that this trend only holds for those sites at which the ancestral allele was a microRNA target site and the derived allele is a non-target. The loss of a microRNA target (as in the examples reported in a previous work (Li et al. 2012)) may be relatively frequent. It follows that the non-target allele might be neutral or even advantageous in some of these cases. It is noticeable that in the examples in which the derived allele reaches a high frequency, that occurs in the non-African populations, which is the pattern that would be expected if a neutral derived allele spread by genetic drift during founder events. Loss of target sites could also have advantageous effects, though the complex interactions that occur in regulatory networks. For example, it has been proposed that in the human lineage the loss of microRNA target sites contributed to an increase in the expression levels of some genes (Gardner and Vinther 2008). We also recently reported the loss of multiple target sites in Out-of-African populations (Helmy et al. 2019). Our work suggest that this loss of targets may be continuing now in human populations. Selection in favour of new target sites appears to be rarer: we found a strong signal of purifying selection against novel microRNA target sites.

It is worth mentioning that our result as well as other works are based on precomputed Fst values that may be affected by systematic biases in allele imputation. As more genome sequences of high quality are released, and new Fst estimations become available, it will be important to re-evaluate the impact of population differentiation in microRNA target sites. In general, our results also depend on the ability to predict microRNA target sites from primary sequences. As the number of polymorphic microRNA target sites experimentally validated is very limited, at the moment our approach is the only possible. However, it’ll be important to re-evaluate our results in due course as more datasets become available. Also, our statistical analysis relies on two types of null models: microRNAs with not detected expression, and the analysis of the reverse complement sequence of 3’UTRs. It is possible that microRNAs expressed in only a few cells are wrongly attributed to the ‘null’ class. Likewise, the reverse complement of 3’UTR could also be potentially transcribed (antisense transcripts) and, hence, wrongly selected as ‘null’ class. In any case, the use of complementary null models leading to similar results supports our hypothesis, and indicates that the use of better annotated datasets in the future may even increase the statistical evidence in favour of selection against microRNA target sites.

Our results suggest that new microRNA target site generating mutations (the derive allele is the target) are selected against. This is a case of the classic selection-mutation balance in which the mutations are deleterious. Selection against deleterious mutations has been extensively studied in population genetics (reviewed in (Charlesworth 2012)). For instance, strong purifying selection produces a phenomenon called background selection, in which loci linked to the selected site experience a reduction in their effective population size (Charlesworth et al. 1993). That is, purifying selection reduces the influence of selection at linked sites. For weakly selected sites, a similar process has been described: weak selection Hill-Robertson interference (wsHR (Hill and Robertson 1966; McVean and Charlesworth 2000)). Under wsHR, multiple alleles are under a weak selective force, very close together so that recombination is small or negligible between sites, interfering with other selective pressures in the area. Multiple weakly deleterious mutation at transcription factor binding sites have been reported indeed (Abecasis et al. 2012). We believe that this is the case for the selection against microRNA target sites here described: weak selection against multiple target/near-target sites will shape the evolutionary landscape of the entire untranslated region.

Our study suggest that the mutation rate in humans may be high enough to produce a significant selective pressure against novel microRNA target sites. New target sites will emerge at a significant rate because many mutations can potentially introduce a new site for one of the many microRNAs. More specifically, there are about 2,000 microRNA families described in TargetScan (see Methods), defined by 7-nucleotide seed sequences. Assuming that 3’UTRs are composed of non-overlapping 7-mers (a simplifying yet conservative assumption) the expected number of near-target sites per kilobase (kb) is about 75. With a mutation rate of 2.5 10^-5 per kb (Nachman and Crowell 2000) and a total length of the genome that encode 3’UTRs of 34Mb, it can be shown that there will be on average one novel microRNA target site on a 3’UTR per genome per generation. That is, one potential deleterious microRNA target site per person per generation.

It is expected that other regulatory motifs influence the evolution of 3’UTRs. For instance, Savisaar et al (Savisaar and Hurst 2017) have described selection against RNA-binding motifs. The selective avoidance of transcription factor binding sites (Hahn et al. 2003; Babbitt 2010) and of mRNA/ncRNA regulatory interactions in bacteria (Umu et al. 2016) have been also described. These works are based on comparative genomics between different species and not on variation within populations. However, some theoretical models take into account the selective pressure against regulatory motifs (Berg et al. 2004; Stewart et al. 2012). It is likely that on top of all the selective forces that are usually taken into account, there is a layer of selection against weakly deleterious regulatory motifs that will be influencing the evolution of the genome. In conclusion, selection against microRNA target sites in prevalent in human populations, and it may constrain other selective forces in post-transcriptional regulatory regions.

METHODS

MicroRNA mature sequences were downloaded from miRBase v.22, considering only sequences annotated with ‘high-confidence’ (Kozomara et al. 2018). MicroRNA target and near-target sites were predicted with seedVicious (v.1.1 (Marco 2018b)]) against 3’UTR as annotated in Ensembl version 96 (Kinsella et al. 2011) for the human genome assembly hg38. Single nucleotide polymorphisms for the 1000 Genomes project (1000 Genomes Project Consortium et al. 2015) were retrieved from dbSNP (build 151) (Sherry et al. 2001) and mapped to our target predictions. Ensembl sequences and polymorphism data were downloaded using the BiomaRt R package (Durinck et al. 2009). By mapping SNPs to targets and to near-targets (as described in (Marco 2018b)) we are able to identify pairs of alleles in which only one of the allele is a target site, so allele frequencies can be computed as target allele frequencies (Hatlen and Marco 2019). When plotting allele frequencies (Figure 2) we only considered segregating alleles in which the minor allele is present in at least 1% of the sampled population, as reported dbSNP (see (Hatlen et al. 2019) for details). The ancestral allele status was also retrieved from the pre-computed values available from dbSNP. To compute the background (randomly expected) allele distributions we repeated the process but finding targets in the reverse complement strand of the 3’UTR, to control for sequence length and composition. In the analysis of allele frequencies we used a second microRNA target prediction program based on a different principle, miRanda (Enright et al. 2003), which takes into account the binding energy of the RNA:RNA duplex, and we use the default parameters. About 25% of polymorphic canonical target sites were also predicted as targets by miRanda.

Expression information for microRNAs was obtained our own database PopTargs (https://poptargs.essex.ac.uk; Hatlen and Marco 2019). In summary, high-throughput experiments from Meunier et al. (Meunier et al. 2013) and miRMine (Panwar et al. 2017) were downloaded, reads were mapped to miRBase mature microRNA sequences and the average reads per million for each microRNAs was consider to measure the expression level in a tissue. Coding genes were considered to be expressed in a given tissue as pre-computed in the Bgee database (‘gold’ set, version 14, (Bastian et al. 2008)). We studied the following tissues: lung, blood, placenta, liver, heart, brain, kidney, cerebellum, breast and testis. Importantly, most microRNA and mRNA extractions were done in the same commercial samples by the same lab (Brawand et al. 2011; Meunier et al. 2013), which suggest that co-expressed microRNA/mRNA pairs were present in the same tissue. MicroRNAs with more than 50 RPM (reads per million) were considered moderately expressed, and those with over 500 RPM were labelled as highly expressed. For transcripts we did not make distinctions between different expression levels, as transcript levels are affected by the action of microRNAs (Nam et al. 2014). Co-expressed microRNA:gene pairs were those whose microRNA was moderately or highly expressed in a tissue (depending on the analysis described) and the transcript targeted is detected in the same tissue. For ‘blood’ and ‘liver’ tissues we perform additional analysis considering the tissue specific polyadenilation signals as reported in APADB version 2 (Müller et al. 2014). MicroRNAs were grouped into evolutionary conservation categories depending on the species spread (primates, mammals and animals) of the seed family in TargetScan 7.2 (Agarwal et al. 2015).

Fst values were retrieved from the 1,000 Genomes Selection Browser 1.0 (Pybus et al. 2014). In Figure 4 the fold enrichment was computed as the logarithm of the ratio between the proportion of SNPs at target sites and the proportion of SNPs at a background site for each Fst bin. The latitude of specific populations was computed as in (Helmy et al. 2019). First, the geographical locations of the studied populations were obtained from the sample descriptions at https://www.coriell.org/1/NHGRI/Collections/1000-Genomes-Collections/1000-Genomes-Project., using the location of the capital of the country in cases where the location was ambiguous or nonspecific. For multiple collection points we computed the centroid middle point. Populations that migrated in recent times were discarded (CHD, CEU, ASW, ACB, GIH, STU and ITU as described at IGSR: The International Genome Sample Resource (http://www.internationalgenome.org/category/population/)).

All statistical analyses were done with R (v. 3.4.3, (R Development Core Team 2009)). All processed datasets and online tools to compute the tests here reported are available at our dedicated web server PopTargs (https://poptargs.essex.ac.uk; Hatlen and Marco, 2019). All the code use for the generation of the results, figures and tables is freely available from FigShare at https://doi.org/10.6084/m9.figshare.9539645.

SUPPORTING INFORMATION LEGENDS

Supplementary Figure 1. Derived (target) allele frequency distributions for ten different tissues, compared with the expected distribution.

Supplementary Figure 2. Derived (target) allele frequency distributions for ten different tissues for highly expressed microRNAs, compared with the distribution for microRNAs with no detectable expression level.

Supplementary Figure 3. Derived (target) allele frequency distributions for wobble and non-wobble non-targets.

Supplementary Table 1. Statistical comparison of the derived (target) allele distributions at potential target sites versus background (expected) for 10 different tissues.

Supplementary Table 2. Statistical comparison of the derived (target) allele distributions at potential target sites of highly-expressed microRNAs versus target sites for non-detected (zero expressed) microRNAs for 10 different tissues.

Supplementary Table 3. P-values for the statistical support of effect of dependent variables and interaction in two linear models (see main text).

ACKNOWLEDGEMENTS

We thank Richard Nichols and Ben Skinner for critical reading of the manuscript and useful comments, and an editor and two anonymous reviewers for constructive criticisms on the methodology. We also acknowledge the use of the High Performance Computing Facility (Ceres) and its associated support services at the University of Essex in the completion of this work. This research was supported by the Wellcome Trust [grant number 200585/Z/16/Z].

Footnotes

Minos changes (typos) and additional discussion on limitations of the study.
https://doi.org/10.6084/m9.figshare.9539645

REFERENCES

↵
1000 Genomes Project Consortium, Auton A, Brooks LD, Durbin RM, Garrison EP, Kang HM, Korbel JO, Marchini JL, McCarthy S, McVean GA, et al. 2015. A global reference for human genetic variation. Nature 526:68–74.
OpenUrl CrossRef PubMed
↵
Abecasis GR, Auton A, Brooks LD, DePristo MA, Durbin RM, Handsaker RE, Kang HM, Marth GT, McVean GA. 2012. An integrated map of genetic variation from 1,092 human genomes. Nature 491:56–65.
OpenUrl CrossRef PubMed Web of Science
↵
Abelson JF, Kwan KY, O’Roak BJ, Baek DY, Stillman AA, Morgan TM, Mathews CA, Pauls DL, Rašin M-R, Gunel M, et al. 2005. Sequence Variants in SLITRK1 Are Associated with Tourette’s Syndrome. Science 310:317–320.
OpenUrl Abstract/FREE Full Text
↵
Agarwal V, Bell GW, Nam J-W, Bartel DP. 2015. Predicting effective microRNA target sites in mammalian mRNAs. eLife 4.
↵
Alexiou P, Maragkakis M, Papadopoulos GL, Reczko M, Hatzigeorgiou AG. 2009. Lost in translation: an assessment and perspective for computational microRNA target identification. Bioinformatics 25:3049–3055.
OpenUrl CrossRef PubMed Web of Science
↵
Arcila ML, Betizeau M, Cambronne XA, Guzman E, Doerflinger N, Bouhallier F, Zhou H, Wu B, Rani N, Bassett DS, et al. 2014. Novel primate miRNAs co-evolved with ancient target genes in germinal zone specific expression patterns. Neuron 81:1255–1262.
OpenUrl CrossRef PubMed Web of Science
↵
Babbitt GA. 2010. Relaxed selection against accidental binding of transcription factors with conserved chromatin contexts. Gene 466:43–48.
OpenUrl CrossRef PubMed Web of Science
↵
Barreiro LB, Laval G, Quach H, Patin E, Quintana-Murci L. 2008. Natural selection has driven population differentiation in modern humans. Nat. Genet. 40:340–345.
OpenUrl CrossRef PubMed Web of Science
↵
Bartel DP, Chen C-Z. 2004. Micromanagers of gene expression: the potentially widespread influence of metazoan microRNAs. Nat. Rev. Genet. 5:396–400.
OpenUrl CrossRef PubMed Web of Science
↵
Bastian F, Parmentier G, Roux J, Moretti S, Laudet V, Robinson-Rechavi M. 2008. Bgee: Integrating and Comparing Heterogeneous Transcriptome Data Among Species. In: Data Integration in the Life Sciences. Lecture Notes in Computer Science. Springer, Berlin, Heidelberg. p. 124–131. Available from: https://link.springer.com/chapter/10.1007/978-3-540-69828-9_12
↵
Berg J, Willmann S, LÃ¤ssig M. 2004. Adaptive evolution of transcription factor binding sites. BMC Evol Biol [Internet] 4. Available from: http://dx.doi.org/10.1186/1471-2148-4-42
↵
Bhatia G, Patterson N, Sankararaman S, Price AL. 2013. Estimating and interpreting FST: The impact of rare variants. Genome Res. 23:1514–1521.
OpenUrl Abstract/FREE Full Text
↵
Brawand D, Soumillon M, Necsulea A, Julien P, Csardi G, Harrigan P, Weier M, Liechti A, Aximu-Petri A, Kircher M, et al. 2011. The evolution of gene expression levels in mammalian organs. Nature 478:343–348.
OpenUrl CrossRef PubMed Web of Science
↵
Charlesworth B. 2012. The Effects of Deleterious Mutations on Evolution at Linked Sites. Genetics 190:5–22.
OpenUrl Abstract/FREE Full Text
↵
Charlesworth B, Morgan MT, Charlesworth D. 1993. The effect of deleterious mutations on neutral molecular variation. Genetics 134:1289–1303.
OpenUrl Abstract/FREE Full Text
↵
Chen K, Rajewsky N. 2006. Natural selection on human microRNA binding sites inferred from SNP data. Nat. Genet. 38:1452–1456.
OpenUrl CrossRef PubMed Web of Science
↵
Chi SW, Zang JB, Mele A, Darnell RB. 2009. Argonaute HITS-CLIP decodes microRNA-mRNA interaction maps. Nature 460:479–486.
OpenUrl CrossRef PubMed Web of Science
↵
Coop G, Pickrell JK, Novembre J, Kudaravalli S, Li J, Absher D, Myers RM, Cavalli-Sforza LL, Feldman MW, Pritchard JK. 2009. The Role of Geography in Human Adaptation. PLOS Genet. 5:e1000500.
OpenUrl CrossRef PubMed
↵
Durinck S, Spellman PT, Birney E, Huber W. 2009. Mapping identifiers for the integration of genomic datasets with the R/Bioconductor package biomaRt. Nat. Protoc. 4:1184.
OpenUrl CrossRef PubMed Web of Science
↵
Dusl M, Senderek J, Müller JS, Vogel JG, Pertl A, Stucka R, Lochmüller H, David R, Abicht A. 2015. A 3′-UTR mutation creates a microRNA target site in the GFPT1 gene of patients with congenital myasthenic syndrome. Hum. Mol. Genet. 24:3418–3426.
OpenUrl CrossRef PubMed
↵
Enard D, Messer PW, Petrov DA. 2014. Genome-wide signals of positive selection in human evolution. Genome Res. 24:885–895.
OpenUrl Abstract/FREE Full Text
↵
Enright A, John B, Gaul U, Tuschl T, Sander C, Marks D. 2003. MicroRNA targets in Drosophila. Genome Biol. 5:R1.
OpenUrl CrossRef PubMed
↵
Farh KK-H, Grimson A, Jan C, Lewis BP, Johnston WK, Lim LP, Burge CB, Bartel DP. 2005. The widespread impact of mammalian MicroRNAs on mRNA repression and evolution. Science 310:1817–1821.
OpenUrl Abstract/FREE Full Text
↵
Gardner PP, Vinther J. 2008. Mutation of miRNA target sequences during human evolution. Trends Genet. TIG 24:262–265.
OpenUrl
↵
Grün D, Wang Y-L, Langenberger D, Gunsalus KC, Rajewsky N. 2005. microRNA Target Predictions across Seven Drosophila Species and Comparison to Mammalian Targets. PLOS Comput. Biol. 1:e13.
OpenUrl CrossRef PubMed
↵
Hahn M, Stajich J, Wray G. 2003. The effects of selection against spurious transcription factor binding sites. Mol Biol Evol 20:906, 901.
OpenUrl
↵
Hatlen A, Helmy M, Marco A. 2019. PopTargs: a database for studying population evolutionary genetics of human microRNA target sites. Database [Internet] 2019. Available from: https://academic.oup.com/database/article/doi/10.1093/database/baz102/5585574
↵
Helmy M, Hatlen A, Marco A. 2019. The Impact of Population Variation in the Analysis of microRNA Target Sites. Non-Coding RNA 5:42.
OpenUrl
↵
Hill WG, Robertson A. 1966. The effect of linkage on limits to artificial selection. Genet. Res. 8:269–294.
OpenUrl CrossRef PubMed Web of Science
↵
Hui JHL, Marco A, Hunt S, Melling J, Griffiths-Jones S, Ronshaugen M. 2013. Structure, evolution and function of the bi-directionally transcribed iab-4/iab-8 microRNA locus in arthropods. Nucleic Acids Res. 41:3352–3361.
OpenUrl CrossRef PubMed Web of Science
↵
Kinsella RJ, Kähäri A, Haider S, Zamora J, Proctor G, Spudich G, Almeida-King J, Staines D, Derwent P, Kerhornou A, et al. 2011. Ensembl BioMarts: a hub for data retrieval across taxonomic space. Database J. Biol. Databases Curation 2011:bar030.
OpenUrl
↵
Kozomara A, Birgaoanu M, Griffiths-Jones S. 2018. miRBase: from microRNA sequences to function. Nucleic Acids Res.
↵
Kozomara A, Griffiths-Jones S. 2014. miRBase: annotating high confidence microRNAs using deep sequencing data. Nucleic Acids Res. 42:D68–73.
OpenUrl CrossRef PubMed Web of Science
↵
Lagos-Quintana M, Rauhut R, Lendeckel W, Tuschl T. 2001. Identification of Novel Genes Coding for Small Expressed RNAs. Science 294:853–858.
OpenUrl Abstract/FREE Full Text
↵
Lau NC, Lim LP, Weinstein EG, Bartel DP. 2001. An Abundant Class of Tiny RNAs with Probable Regulatory Roles in Caenorhabditis elegans. Science 294:858–862.
OpenUrl Abstract/FREE Full Text
↵
Lee RC, Ambros V. 2001. An extensive class of small RNAs in Caenorhabditis elegans. Science 294:862–864.
OpenUrl Abstract/FREE Full Text
↵
Lewis BP, Burge CB, Bartel DP. 2005. Conserved seed pairing, often flanked by adenosines, indicates that thousands of human genes are microRNA targets. Cell 120:15–20.
OpenUrl CrossRef PubMed Web of Science
↵
Lewis BP, Shih I, Jones-Rhoades MW, Bartel DP, Burge CB. 2003. Prediction of mammalian microRNA targets. Cell 115:787–798.
OpenUrl CrossRef PubMed Web of Science
↵
Li J, Liu Y, Xin X, Kim TS, Cabeza EA, Ren J, Nielsen R, Wrana JL, Zhang Z. 2012. Evidence for Positive Selection on a Number of MicroRNA Regulatory Interactions during Recent Human Evolution. PLoS Genet 8:e1002578.
OpenUrl CrossRef PubMed
↵
Marco A. 2015. Selection Against Maternal microRNA Target Sites in Maternal Transcripts. G3 GenesGenomesGenetics:g3.115.019497.
↵
Marco A. 2018a. No evidence of functional co-adaptation between clustered microRNAs. bioRxiv:274811.
↵
Marco A. 2018b. SeedVicious: Analysis of microRNA target and near-target sites. PLOS ONE 13:e0195532.
OpenUrl CrossRef
↵
McVean GAT, Charlesworth B. 2000. The Effects of Hill-Robertson Interference Between Weakly Selected Mutations on Patterns of Molecular Evolution and Variation. Genetics 155:929–944.
OpenUrl Abstract/FREE Full Text
↵
Meunier J, Lemoine F, Soumillon M, Liechti A, Weier M, Guschanski K, Hu H, Khaitovich P, Kaessmann H. 2013. Birth and expression evolution of mammalian microRNA genes. Genome Res. 23:34–45.
OpenUrl Abstract/FREE Full Text
↵
Mishra Anshuman, Nizammuddin S, Mallick CB, Singh S, Prakash S, Siddiqui NA, Rai N, Carlus SJ, Sudhakar DVS, Tripathi VP, et al. 2017. Genotype-Phenotype Study of the Middle Gangetic Plain in India Shows Association of rs2470102 with Skin Pigmentation. J. Invest. Dermatol. 137:670–677.
OpenUrl
↵
Müller S, Rycak L, Afonso-Grunz F, Winter P, Zawada AM, Damrath E, Scheider J, Schmäh J, Koch I, Kahl G, et al. 2014. APADB: a database for alternative polyadenylation and microRNA regulation events. Database J. Biol. Databases Curation [Internet] 2014. Available from: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4105710/
↵
Nachman MW, Crowell SL. 2000. Estimate of the Mutation Rate per Nucleotide in Humans. Genetics 156:297–304.
OpenUrl Abstract/FREE Full Text
↵
Nam J-W, Rissland OS, Koppstein D, Abreu-Goodger C, Jan CH, Agarwal V, Yildirim MA, Rodriguez A, Bartel DP. 2014. Global analyses of the effect of different cellular contexts on microRNA targeting. Mol. Cell 53:1031–1043.
OpenUrl CrossRef PubMed Web of Science
↵
Nei M. 1986. Definition and estimation of fixation indices. Evol. Int. J. Org. Evol. 40:643–645.
OpenUrl CrossRef
↵
Panwar B, Omenn GS, Guan Y. 2017. miRmine: a database of human miRNA expression profiles. Bioinformatics 33:1554–1560.
OpenUrl
↵
Paraskevopoulou MD, Georgakilas G, Kostoulas N, Vlachos IS, Vergoulis T, Reczko M, Filippidis C, Dalamagas T, Hatzigeorgiou AG. 2013. DIANA-microT web server v5.0: service integration into miRNA functional analysis workflows. Nucleic Acids Res. 41:W169–W173.
OpenUrl CrossRef PubMed Web of Science
↵
Pasquinelli AE, Reinhart BJ, Slack F, Martindale MQ, Kuroda MI, Maller B, Hayward DC, Ball EE, Degnan B, Müller P, et al. 2000. Conservation of the sequence and temporal expression of let-7 heterochronic regulatory RNA. Nature 408:86–89.
OpenUrl CrossRef PubMed Web of Science
↵
Pybus M, Dall’Olio GM, Luisi P, Uzkudun M, Carreño-Torres A, Pavlidis P, Laayouni H, Bertranpetit J, Engelken J. 2014. 1000 Genomes Selection Browser 1.0: a genome browser dedicated to signatures of natural selection in modern humans. Nucleic Acids Res. 42:D903–9.
OpenUrl CrossRef PubMed Web of Science
↵
R Development Core Team. 2009. {R: A language and environment for statistical computing}. Vienna, Austria: R Foundation for Statistical Computing
↵
Sarkar A, Nandineni MR. 2018. Association of common genetic variants with human skin color variation in Indian populations. Am. J. Hum. Biol. 30:e23068.
OpenUrl
↵
Saunders MA, Liang H, Li W-H. 2007. Human polymorphism at microRNAs and microRNA target sites. Proc. Natl. Acad. Sci. U. S. A. 104:3300–3305.
OpenUrl Abstract/FREE Full Text
↵
Savisaar R, Hurst LD. 2017. Both Maintenance and Avoidance of RNA-Binding Protein Interactions Constrain Coding Sequence Evolution. Mol. Biol. Evol. 34:1110–1126.
OpenUrl
↵
Sherry ST, Ward MH, Kholodov M, Baker J, Phan L, Smigielski EM, Sirotkin K. 2001. dbSNP: the NCBI database of genetic variation. Nucleic Acids Res. 29:308–311.
OpenUrl CrossRef PubMed Web of Science
↵
Sokal RR, Rohlf FJ. 1995. Biometry: The Principles and Practice of Statistics in Biological Research. Freeman
↵
Sood P, Krek A, Zavolan M, Macino G, Rajewsky N. 2006. Cell-type-specific signatures of microRNAs on target mRNA expression. Proc. Natl. Acad. Sci. U. S. A. 103:2746–2751.
OpenUrl Abstract/FREE Full Text
↵
Stark A, Brennecke J, Bushati N, Russell RB, Cohen SM. 2005. Animal MicroRNAs confer robustness to gene expression and have a significant impact on 3’UTR evolution. Cell 123:1133–1146.
OpenUrl CrossRef PubMed Web of Science
↵
Stark A, Brennecke J, Russell RB, Cohen SM. 2003. Identification of Drosophila MicroRNA targets. PLoS Biol. 1:E60.
OpenUrl CrossRef PubMed
↵
Stewart AJ, Hannenhalli S, Plotkin JB. 2012. Why transcription factor binding sites are ten nucleotides long. Genetics 192:973–985.
OpenUrl Abstract/FREE Full Text
↵
Umu SU, Poole AM, Dobson RC, Gardner PP. 2016. Avoidance of stochastic RNA interactions can be harnessed to control protein expression levels in bacteria and archaea. eLife [Internet]. Available from: https://elifesciences.org/articles/13479
↵
Zhou L, Lim MYT, Kaur P, Saj A, Bortolamiol-Becet D, Gopal V, Tolwinski N, Tucker-Kellogg G, Okamura K. 2018. Importance of miRNA stability and alternative primary miRNA isoforms in gene regulation during Drosophila development. eLife [Internet]. Available from: https://elifesciences.org/articles/38389

View the discussion thread.

Posted June 09, 2020.

Download PDF

Supplementary Material

Data/Code

Citation Tools

Subject Area

Evolutionary Biology

Subject Areas

All Articles

Animal Behavior and Cognition (5201)
Biochemistry (11718)
Bioengineering (8724)
Bioinformatics (29132)
Biophysics (14936)
Cancer Biology (12051)
Cell Biology (17360)
Clinical Trials (138)
Developmental Biology (9406)
Ecology (14146)
Epidemiology (2067)
Evolutionary Biology (18269)
Genetics (12223)
Genomics (16768)
Immunology (11844)
Microbiology (28016)
Molecular Biology (11560)
Neuroscience (60822)
Paleontology (450)
Pathology (1864)
Pharmacology and Toxicology (3231)
Physiology (4940)
Plant Biology (10401)
Scientific Communication and Education (1680)
Synthetic Biology (2878)
Systems Biology (7333)
Zoology (1642)

[1] ↵
1000 Genomes Project Consortium, Auton A, Brooks LD, Durbin RM, Garrison EP, Kang HM, Korbel JO, Marchini JL, McCarthy S, McVean GA, et al. 2015. A global reference for human genetic variation. Nature 526:68–74.
OpenUrl CrossRef PubMed

[2] ↵
Abecasis GR, Auton A, Brooks LD, DePristo MA, Durbin RM, Handsaker RE, Kang HM, Marth GT, McVean GA. 2012. An integrated map of genetic variation from 1,092 human genomes. Nature 491:56–65.
OpenUrl CrossRef PubMed Web of Science

[3] ↵
Abelson JF, Kwan KY, O’Roak BJ, Baek DY, Stillman AA, Morgan TM, Mathews CA, Pauls DL, Rašin M-R, Gunel M, et al. 2005. Sequence Variants in SLITRK1 Are Associated with Tourette’s Syndrome. Science 310:317–320.
OpenUrl Abstract/FREE Full Text

[4] ↵
Agarwal V, Bell GW, Nam J-W, Bartel DP. 2015. Predicting effective microRNA target sites in mammalian mRNAs. eLife 4.

[5] ↵
Alexiou P, Maragkakis M, Papadopoulos GL, Reczko M, Hatzigeorgiou AG. 2009. Lost in translation: an assessment and perspective for computational microRNA target identification. Bioinformatics 25:3049–3055.
OpenUrl CrossRef PubMed Web of Science

[6] ↵
Arcila ML, Betizeau M, Cambronne XA, Guzman E, Doerflinger N, Bouhallier F, Zhou H, Wu B, Rani N, Bassett DS, et al. 2014. Novel primate miRNAs co-evolved with ancient target genes in germinal zone specific expression patterns. Neuron 81:1255–1262.
OpenUrl CrossRef PubMed Web of Science

[7] ↵
Babbitt GA. 2010. Relaxed selection against accidental binding of transcription factors with conserved chromatin contexts. Gene 466:43–48.
OpenUrl CrossRef PubMed Web of Science

[8] ↵
Barreiro LB, Laval G, Quach H, Patin E, Quintana-Murci L. 2008. Natural selection has driven population differentiation in modern humans. Nat. Genet. 40:340–345.
OpenUrl CrossRef PubMed Web of Science

[9] ↵
Bartel DP, Chen C-Z. 2004. Micromanagers of gene expression: the potentially widespread influence of metazoan microRNAs. Nat. Rev. Genet. 5:396–400.
OpenUrl CrossRef PubMed Web of Science

[10] ↵
Bastian F, Parmentier G, Roux J, Moretti S, Laudet V, Robinson-Rechavi M. 2008. Bgee: Integrating and Comparing Heterogeneous Transcriptome Data Among Species. In: Data Integration in the Life Sciences. Lecture Notes in Computer Science. Springer, Berlin, Heidelberg. p. 124–131. Available from: https://link.springer.com/chapter/10.1007/978-3-540-69828-9_12

[11] ↵
Berg J, Willmann S, LÃ¤ssig M. 2004. Adaptive evolution of transcription factor binding sites. BMC Evol Biol [Internet] 4. Available from: http://dx.doi.org/10.1186/1471-2148-4-42

[12] ↵
Bhatia G, Patterson N, Sankararaman S, Price AL. 2013. Estimating and interpreting FST: The impact of rare variants. Genome Res. 23:1514–1521.
OpenUrl Abstract/FREE Full Text

[13] ↵
Brawand D, Soumillon M, Necsulea A, Julien P, Csardi G, Harrigan P, Weier M, Liechti A, Aximu-Petri A, Kircher M, et al. 2011. The evolution of gene expression levels in mammalian organs. Nature 478:343–348.
OpenUrl CrossRef PubMed Web of Science

[14] ↵
Charlesworth B. 2012. The Effects of Deleterious Mutations on Evolution at Linked Sites. Genetics 190:5–22.
OpenUrl Abstract/FREE Full Text

[15] ↵
Charlesworth B, Morgan MT, Charlesworth D. 1993. The effect of deleterious mutations on neutral molecular variation. Genetics 134:1289–1303.
OpenUrl Abstract/FREE Full Text

[16] ↵
Chen K, Rajewsky N. 2006. Natural selection on human microRNA binding sites inferred from SNP data. Nat. Genet. 38:1452–1456.
OpenUrl CrossRef PubMed Web of Science

[17] ↵
Chi SW, Zang JB, Mele A, Darnell RB. 2009. Argonaute HITS-CLIP decodes microRNA-mRNA interaction maps. Nature 460:479–486.
OpenUrl CrossRef PubMed Web of Science

[18] ↵
Coop G, Pickrell JK, Novembre J, Kudaravalli S, Li J, Absher D, Myers RM, Cavalli-Sforza LL, Feldman MW, Pritchard JK. 2009. The Role of Geography in Human Adaptation. PLOS Genet. 5:e1000500.
OpenUrl CrossRef PubMed

[19] ↵
Durinck S, Spellman PT, Birney E, Huber W. 2009. Mapping identifiers for the integration of genomic datasets with the R/Bioconductor package biomaRt. Nat. Protoc. 4:1184.
OpenUrl CrossRef PubMed Web of Science

[20] ↵
Dusl M, Senderek J, Müller JS, Vogel JG, Pertl A, Stucka R, Lochmüller H, David R, Abicht A. 2015. A 3′-UTR mutation creates a microRNA target site in the GFPT1 gene of patients with congenital myasthenic syndrome. Hum. Mol. Genet. 24:3418–3426.
OpenUrl CrossRef PubMed

[21] ↵
Enard D, Messer PW, Petrov DA. 2014. Genome-wide signals of positive selection in human evolution. Genome Res. 24:885–895.
OpenUrl Abstract/FREE Full Text

[22] ↵
Enright A, John B, Gaul U, Tuschl T, Sander C, Marks D. 2003. MicroRNA targets in Drosophila. Genome Biol. 5:R1.
OpenUrl CrossRef PubMed

[23] ↵
Farh KK-H, Grimson A, Jan C, Lewis BP, Johnston WK, Lim LP, Burge CB, Bartel DP. 2005. The widespread impact of mammalian MicroRNAs on mRNA repression and evolution. Science 310:1817–1821.
OpenUrl Abstract/FREE Full Text

[24] ↵
Gardner PP, Vinther J. 2008. Mutation of miRNA target sequences during human evolution. Trends Genet. TIG 24:262–265.
OpenUrl

[25] ↵
Grün D, Wang Y-L, Langenberger D, Gunsalus KC, Rajewsky N. 2005. microRNA Target Predictions across Seven Drosophila Species and Comparison to Mammalian Targets. PLOS Comput. Biol. 1:e13.
OpenUrl CrossRef PubMed

[26] ↵
Hahn M, Stajich J, Wray G. 2003. The effects of selection against spurious transcription factor binding sites. Mol Biol Evol 20:906, 901.
OpenUrl

[27] ↵
Hatlen A, Helmy M, Marco A. 2019. PopTargs: a database for studying population evolutionary genetics of human microRNA target sites. Database [Internet] 2019. Available from: https://academic.oup.com/database/article/doi/10.1093/database/baz102/5585574

[28] ↵
Helmy M, Hatlen A, Marco A. 2019. The Impact of Population Variation in the Analysis of microRNA Target Sites. Non-Coding RNA 5:42.
OpenUrl

[29] ↵
Hill WG, Robertson A. 1966. The effect of linkage on limits to artificial selection. Genet. Res. 8:269–294.
OpenUrl CrossRef PubMed Web of Science

[30] ↵
Hui JHL, Marco A, Hunt S, Melling J, Griffiths-Jones S, Ronshaugen M. 2013. Structure, evolution and function of the bi-directionally transcribed iab-4/iab-8 microRNA locus in arthropods. Nucleic Acids Res. 41:3352–3361.
OpenUrl CrossRef PubMed Web of Science

[31] ↵
Kinsella RJ, Kähäri A, Haider S, Zamora J, Proctor G, Spudich G, Almeida-King J, Staines D, Derwent P, Kerhornou A, et al. 2011. Ensembl BioMarts: a hub for data retrieval across taxonomic space. Database J. Biol. Databases Curation 2011:bar030.
OpenUrl

[32] ↵
Kozomara A, Birgaoanu M, Griffiths-Jones S. 2018. miRBase: from microRNA sequences to function. Nucleic Acids Res.

[33] ↵
Kozomara A, Griffiths-Jones S. 2014. miRBase: annotating high confidence microRNAs using deep sequencing data. Nucleic Acids Res. 42:D68–73.
OpenUrl CrossRef PubMed Web of Science

[34] ↵
Lagos-Quintana M, Rauhut R, Lendeckel W, Tuschl T. 2001. Identification of Novel Genes Coding for Small Expressed RNAs. Science 294:853–858.
OpenUrl Abstract/FREE Full Text

[35] ↵
Lau NC, Lim LP, Weinstein EG, Bartel DP. 2001. An Abundant Class of Tiny RNAs with Probable Regulatory Roles in Caenorhabditis elegans. Science 294:858–862.
OpenUrl Abstract/FREE Full Text

[36] ↵
Lee RC, Ambros V. 2001. An extensive class of small RNAs in Caenorhabditis elegans. Science 294:862–864.
OpenUrl Abstract/FREE Full Text

[37] ↵
Lewis BP, Burge CB, Bartel DP. 2005. Conserved seed pairing, often flanked by adenosines, indicates that thousands of human genes are microRNA targets. Cell 120:15–20.
OpenUrl CrossRef PubMed Web of Science

[38] ↵
Lewis BP, Shih I, Jones-Rhoades MW, Bartel DP, Burge CB. 2003. Prediction of mammalian microRNA targets. Cell 115:787–798.
OpenUrl CrossRef PubMed Web of Science

[39] ↵
Li J, Liu Y, Xin X, Kim TS, Cabeza EA, Ren J, Nielsen R, Wrana JL, Zhang Z. 2012. Evidence for Positive Selection on a Number of MicroRNA Regulatory Interactions during Recent Human Evolution. PLoS Genet 8:e1002578.
OpenUrl CrossRef PubMed

[40] ↵
Marco A. 2015. Selection Against Maternal microRNA Target Sites in Maternal Transcripts. G3 GenesGenomesGenetics:g3.115.019497.

[41] ↵
Marco A. 2018a. No evidence of functional co-adaptation between clustered microRNAs. bioRxiv:274811.

[42] ↵
Marco A. 2018b. SeedVicious: Analysis of microRNA target and near-target sites. PLOS ONE 13:e0195532.
OpenUrl CrossRef

[43] ↵
McVean GAT, Charlesworth B. 2000. The Effects of Hill-Robertson Interference Between Weakly Selected Mutations on Patterns of Molecular Evolution and Variation. Genetics 155:929–944.
OpenUrl Abstract/FREE Full Text

[44] ↵
Meunier J, Lemoine F, Soumillon M, Liechti A, Weier M, Guschanski K, Hu H, Khaitovich P, Kaessmann H. 2013. Birth and expression evolution of mammalian microRNA genes. Genome Res. 23:34–45.
OpenUrl Abstract/FREE Full Text

[45] ↵
Mishra Anshuman, Nizammuddin S, Mallick CB, Singh S, Prakash S, Siddiqui NA, Rai N, Carlus SJ, Sudhakar DVS, Tripathi VP, et al. 2017. Genotype-Phenotype Study of the Middle Gangetic Plain in India Shows Association of rs2470102 with Skin Pigmentation. J. Invest. Dermatol. 137:670–677.
OpenUrl

[46] ↵
Müller S, Rycak L, Afonso-Grunz F, Winter P, Zawada AM, Damrath E, Scheider J, Schmäh J, Koch I, Kahl G, et al. 2014. APADB: a database for alternative polyadenylation and microRNA regulation events. Database J. Biol. Databases Curation [Internet] 2014. Available from: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4105710/

[47] ↵
Nachman MW, Crowell SL. 2000. Estimate of the Mutation Rate per Nucleotide in Humans. Genetics 156:297–304.
OpenUrl Abstract/FREE Full Text

[48] ↵
Nam J-W, Rissland OS, Koppstein D, Abreu-Goodger C, Jan CH, Agarwal V, Yildirim MA, Rodriguez A, Bartel DP. 2014. Global analyses of the effect of different cellular contexts on microRNA targeting. Mol. Cell 53:1031–1043.
OpenUrl CrossRef PubMed Web of Science

[49] ↵
Nei M. 1986. Definition and estimation of fixation indices. Evol. Int. J. Org. Evol. 40:643–645.
OpenUrl CrossRef

[50] ↵
Panwar B, Omenn GS, Guan Y. 2017. miRmine: a database of human miRNA expression profiles. Bioinformatics 33:1554–1560.
OpenUrl

[51] ↵
Paraskevopoulou MD, Georgakilas G, Kostoulas N, Vlachos IS, Vergoulis T, Reczko M, Filippidis C, Dalamagas T, Hatzigeorgiou AG. 2013. DIANA-microT web server v5.0: service integration into miRNA functional analysis workflows. Nucleic Acids Res. 41:W169–W173.
OpenUrl CrossRef PubMed Web of Science

[52] ↵
Pasquinelli AE, Reinhart BJ, Slack F, Martindale MQ, Kuroda MI, Maller B, Hayward DC, Ball EE, Degnan B, Müller P, et al. 2000. Conservation of the sequence and temporal expression of let-7 heterochronic regulatory RNA. Nature 408:86–89.
OpenUrl CrossRef PubMed Web of Science

[53] ↵
Pybus M, Dall’Olio GM, Luisi P, Uzkudun M, Carreño-Torres A, Pavlidis P, Laayouni H, Bertranpetit J, Engelken J. 2014. 1000 Genomes Selection Browser 1.0: a genome browser dedicated to signatures of natural selection in modern humans. Nucleic Acids Res. 42:D903–9.
OpenUrl CrossRef PubMed Web of Science

[54] ↵
R Development Core Team. 2009. {R: A language and environment for statistical computing}. Vienna, Austria: R Foundation for Statistical Computing

[55] ↵
Sarkar A, Nandineni MR. 2018. Association of common genetic variants with human skin color variation in Indian populations. Am. J. Hum. Biol. 30:e23068.
OpenUrl

[56] ↵
Saunders MA, Liang H, Li W-H. 2007. Human polymorphism at microRNAs and microRNA target sites. Proc. Natl. Acad. Sci. U. S. A. 104:3300–3305.
OpenUrl Abstract/FREE Full Text

[57] ↵
Savisaar R, Hurst LD. 2017. Both Maintenance and Avoidance of RNA-Binding Protein Interactions Constrain Coding Sequence Evolution. Mol. Biol. Evol. 34:1110–1126.
OpenUrl

[58] ↵
Sherry ST, Ward MH, Kholodov M, Baker J, Phan L, Smigielski EM, Sirotkin K. 2001. dbSNP: the NCBI database of genetic variation. Nucleic Acids Res. 29:308–311.
OpenUrl CrossRef PubMed Web of Science

[59] ↵
Sokal RR, Rohlf FJ. 1995. Biometry: The Principles and Practice of Statistics in Biological Research. Freeman

[60] ↵
Sood P, Krek A, Zavolan M, Macino G, Rajewsky N. 2006. Cell-type-specific signatures of microRNAs on target mRNA expression. Proc. Natl. Acad. Sci. U. S. A. 103:2746–2751.
OpenUrl Abstract/FREE Full Text

[61] ↵
Stark A, Brennecke J, Bushati N, Russell RB, Cohen SM. 2005. Animal MicroRNAs confer robustness to gene expression and have a significant impact on 3’UTR evolution. Cell 123:1133–1146.
OpenUrl CrossRef PubMed Web of Science

[62] ↵
Stark A, Brennecke J, Russell RB, Cohen SM. 2003. Identification of Drosophila MicroRNA targets. PLoS Biol. 1:E60.
OpenUrl CrossRef PubMed

[63] ↵
Stewart AJ, Hannenhalli S, Plotkin JB. 2012. Why transcription factor binding sites are ten nucleotides long. Genetics 192:973–985.
OpenUrl Abstract/FREE Full Text

[64] ↵
Umu SU, Poole AM, Dobson RC, Gardner PP. 2016. Avoidance of stochastic RNA interactions can be harnessed to control protein expression levels in bacteria and archaea. eLife [Internet]. Available from: https://elifesciences.org/articles/13479

[65] ↵
Zhou L, Lim MYT, Kaur P, Saj A, Bortolamiol-Becet D, Gopal V, Tolwinski N, Tucker-Kellogg G, Okamura K. 2018. Importance of miRNA stability and alternative primary miRNA isoforms in gene regulation during Drosophila development. eLife [Internet]. Available from: https://elifesciences.org/articles/38389