An ancient satellite repeat controls gene expression and embryonic development in Aedes aegypti through a highly conserved piRNA

Rebecca Halbach; Pascal Miesen; Joep Joosten; Ezgi Taşköprü; Bas Pennings; Chantal B.F. Vogels; Sarah H. Merkling; Constantianus J. Koenraadt; Louis Lambrechts; Ronald P. van Rij

doi:10.1101/2020.01.15.907428

Abstract

Tandem repeat elements such as the highly diverse class of satellite repeats occupy large parts of eukaryotic chromosomes. Most occur at (peri)centromeric and (sub)telomeric regions and have been implicated in chromosome organization, stabilization, and segregation¹. Others are located more dispersed throughout the genome, but their functions remained largely enigmatic. Satellite repeats in euchromatic regions were hypothesized to regulate gene expression in cis by modulation of the local heterochromatin, or in trans via repeat-derived transcripts^2,3. Yet, due to a lack of experimental models, gene regulatory potential of satellite repeats remains largely unexplored. Here we show that, in the vector mosquito Aedes aegypti, a satellite repeat promotes sequence-specific gene silencing via the expression of two abundant PIWI-interacting RNAs (piRNAs). Strikingly, whereas satellite repeats and piRNA sequences generally evolve extremely fast^4-6, this locus was conserved for approximately 200 million years, suggesting a central function in mosquito biology. Tandem repeat-derived piRNA production commenced shortly after egg-laying and inactivation of the most abundant of the two piRNAs in early embryos resulted in an arrest of embryonic development. Transcriptional profiling in these embryos revealed the failure to degrade maternally provided transcripts that are normally cleared during maternal-to-zygotic transition. Our results reveal a novel mechanism in which satellite repeats regulate global gene expression in trans via piRNA-mediated gene silencing, which is fundamental to embryonic development. These findings highlight the regulatory potential of this enigmatic class of repeats.

Main

Even though satellite repeats have been discovered nearly 60 years ago^7,8, and comprise a substantial portion of eukaryotic genomes, little is known about the functions of this class of repetitive DNA. Many satellite repeats are actively transcribed, and some of them produce small interfering (si)RNAs required for the establishment and maintenance of heterochromatic regions^9-16. Around two-thirds of the genome of Aedes aegypti, the most important vector for arthropod-borne viruses like dengue, Zika, and yellow fever virus, consists of repetitive elements¹⁷ (Extended Data Fig 1A), making this mosquito an interesting model to study these sequences. We analyzed small RNAs derived from unique and repetitive sequences in the genome of Ae. aegypti somatic and germline tissues as well as Aag2 cells. Even though satellite repeats constitute less than 10% of the genome, they were not only highly covered by siRNAs (Extended Data Fig 1A), but especially by PIWI-interacting (pi)RNAs (Extended Data Fig 1A). piRNAs are a class of small RNAs that protect animal genomes from harmful parasitic elements like transposons¹⁸. In the fruit fly Drosophila melanogaster, piRNAs are mostly derived from transposon-rich genomic regions termed piRNA clusters¹⁹. Yet, in Ae. aegypti, piRNAs from transposable elements (TEs) are underrepresented compared to their abundance in the genome²⁰, especially in the soma (Extended Data Fig 1A), but instead, we found satellite-repeat derived piRNAs to be highly overrepresented in somatic tissues. Intriguingly, approximately three-quarters of these reads in the soma, and half of the reads in the germline or Aag2 cells represent only two individual sequences that map to a repeat locus on chromosome 3. This locus was about 3.5 kb in size and consisted of 20 full and one disrupted repeat unit organized in a head-to-tail array (Fig 1A, B). These two highly abundant satellite-derived small RNAs were 30 and 29 nucleotides in size, respectively (Extended Data Fig 1B), and resistant to β-elimination, suggesting that they are 2’-O-methylated at their 3’ end, a common feature of mature PIWI-bound piRNAs^21-23 (Fig 1C). We named these two sequences tapiR1 and 2 (tandem repeat-associated piRNA1/2). Expression of tapiR1 and 2 was ubiquitous in both somatic and germline tissues of adult mosquitoes (Extended Data Fig 2A). In Ae. aegypti, the PIWI-interacting RNA pathway has expanded to include seven PIWI genes (Piwi2-7 and Ago3) compared to three in flies²⁴. Immunoprecipitation (IP) in Aag2 cells of the aedine PIWI proteins that are expressed both in the soma and gonads (Piwi4-6 and Ago3) followed by northern blotting or deep sequencing indicates that both tapiR1 and tapiR2 exclusively associate with Piwi4 (Fig 1D, Extended Data Fig 2B, C, Supplementary Fig S1A,B). Indeed, only knockdown of Piwi4, but not of other PIWI or AGO-clade genes reduced tapiR1 and 2 levels (Extended Figure 2D, E, Supplementary Fig S1C,D). Thus far, the piRNA repertoire and function of Piwi4 remained unclear. Piwi4 neither associates with TE nor virus-derived piRNAs²⁵, yet was linked to piRNA biogenesis from transposons²⁵ and to antiviral defense^26,27. As nearly 90 % of Piwi4-associated small RNAs only comprise tapiR1, and, to a much lower extent, tapiR2 (Fig 1D), we hypothesize that tapiR1 not only dominates the piRNA repertoire, but also shapes downstream functions of Piwi4.

Extended Data Figure 1: Expression of piRNAs from a satellite repeat locus.

(A) Fraction of siRNAs and piRNAs mapping on genomic features in libraries derived from germline or somatic Ae. aegypti adult tissues. Small RNAs that overlapped multiple features were assigned to only one category (see Methods). Leftmost bar depicts the abundance of each feature category in the genome.

(B) Read length distribution of tapiR1 and 2 in libraries from Aag2 cells, and adult germline and somatic tissues (oxidized or untreated).

Extended Data Figure 2: tapiR1 and 2 are expressed in Ae. aegypti mosquitoes and associate with PIWI proteins.

(A, D, E) Northern blot of tapiR1 and 2 in different tissues of adult mosquitoes (A), upon dsRNA-mediated knockdown of individual PIWI genes (D), and upon knockdown of miRNA and siRNA pathway genes (E), or a control (dsFLuc and dsRLuc) in Aag2 cells. Ethidium bromide-stained rRNA, or U6 snRNA served as loading control.

(B) Western blot analysis of the indicated PIWI proteins before (input) and after immunoprecipitation (IP) used for the small RNA northern blot of panel C. An IP with empty beads serves as negative control. Tubulin was used to control for non-specific binding.

(C) Immunoprecipitation of the indicated PIWI proteins from Aag2 cells followed by northern blot analyses for tapiR1 and 2.

Figure 1: Conserved piRNAs are expressed from a satellite repeat and associate with Piwi4.

(A) Current annotation of the gene AAEL017385 and its splice variants (RA-RF) on chromosome 3, with the position of the satellite repeat locus and tapiR1/2 piRNAs indicated.

(B) Read coverage of the satellite repeats locus. Depicted are exons of AAELL017385 (blue), and small RNAs per million mapped miRNAs in Aag2 cells.

(C) Small RNA nothern blot of tapiR1 and 2 upon ß-elimination in Aag2 cells. miR-2940-3p serves as positive control for the treatment.

(D) Enrichment or depletion of tapiR1/2 compared to input sample in the indicated PIWI-IP small RNA sequencing libraries (left panel), and fraction of tapiR1/2 on total reads enriched in Piwi4 (right panel).

(E) Sequence conservation of the satellite repeat monomers. All individual repeat monomers from Ae. aegypti, Ae. albopictus and Cx. quinquefasciatus were used to generate the sequence logo. Boxes highlight the position of tapiR1 and 2 in the monomer.

(F) Northern blot analysis of tapiR1/2 in the indicated mosquito species (genera Aedes, Culex, Culiseta, Coquillettidia, and Anopheles) and other insects (Culicoides and Drosophila). Ethidium bromide-stained rRNA serves as loading control. For comparison, Ae. aegypti was included twice. Schematic representation of the phylogenetic relationships are indicated in the bottom panel. Bar lengths are arbitrary and do not reflect evolutionary distances.

Figure 2: tapiR1 silences target RNAs in trans through seed-mediated base pairing.

(A) Schematic representation of the firefly luciferase (FLuc) reporter constructs (left panel) and luciferase assay in Aag2 cells transfected with reporters containing no target site (empty), a fully complementary target site to tapiR1, or a control target site.

(B) Luciferase assay of reporters with tapiR1 target sites, mismatched sites (mm4), or control sequences located at different positions in the reporter mRNA.

(C, D) Luciferase assay of tapiR1 reporters harbouring three consecutive mismatches (C), or increasing number of mismatches (D) at the indicated positions of the piRNA target site in the 3’ UTR of firefly luciferase. Firefly luciferase activity was normalized to the activity of a co-transfected Renilla luciferase reporter. Indicated are mean, standard deviation, and individual measurements of a representative experiment performed with two to three independent clones per construct and measured in triplicates.

(E) log2 expression of mRNAs and lncRNAs in Aag2 cells upon treatment with a tapiR1 specific or control antisense oligonucleotide (AO). Depicted are average read counts in three biological replicates. A pseudo-count of one was added to all values in order to plot values of zero. Diagonal lines indicate a fold change of two. Significance was tested at a false discovery rate (FDR) of 0.01 and a log2 fold change of 0.5 as indicated by coloured dots.

(F) RT-qPCR of tapiR1 target genes upon transfection of Aag2 cells with tapiR1 specific or control AO. Depicted are mean, standard deviation, and individual measurements of one experiment measured in technical duplicates.

Satellite repeats are one of the fastest evolving parts of eukaryotic gnomes. Except for a few examples^28-30, most satellite repeats display high sequence divergence between species, and can even be species-specific^6,31,32, akin to piRNAs^4,5. Hence, we were surprised to find that the identified tandem repeat locus in Ae. aegypti is conserved in the closely related Asian tiger mosquito Ae. albopictus, and even in the more distantly related southern house mosquito Culex quinquefasciatus (Extended Data Fig 3A). This locus is, however, not present in the genome assembly of the malaria vector Anopheles gambiae. The tandem repeat locus differed in the number of monomers across species, and the monomers exhibited substantial length and sequence divergence, both between species and between monomers within one species. However, the parts of the monomer that give rise to tapiR1 and 2 were by far more conserved than the overall monomer, suggesting that these sequences are under extensive selective constraints (Fig 1E). We further analyzed whether expression of the two repeat locus-derived piRNAs is conserved among different mosquitoes, including species for which no genome assembly is available. We analyzed 17 different mosquito species from 5 different genera (Aedes, Culex, Culiseta, Coquillettidia and Anopheles), as well as Culicoides nubeculosus, a biting midge that also transmits arboviruses, but is only distantly related to mosquitoes, and the fruitfly Drosophila melanogaster. Strikingly, even though piRNAs are usually not conserved even between closely related species^4,5, we detected both tapiR1 and 2 in four genera of the Culicinae subfamily of mosquitoes (Fig 1F, Extended Data Fig 3B). In line with the absence of the repeat locus in the Anopheles gambiae genome, we did not observe tapiR1 or 2 expression in this subfamily of mosquitoes, nor in the two non-mosquito species. This observation suggests that the locus evolved in the late Triassic after divergence of the Anophelinae from the Culicinae subfamily of mosquitoes (229-192 mya³³), but before further divergence of the culicine genera (226-172 mya³³). This establishes this repeat locus as one of the very few ancient and deeply conserved satellite repeats that have hitherto been described^28-30,34. Conservation of the locus over million years of mosquito evolution strongly suggests important and conserved functions for the locus and its associated piRNAs.

Extended Data Figure 3: Expression of tapiR1 is independent of AAEL017385.

(A) Schematic representation of the tapiR1/2 satellite repeat locus in Ae. aegypti, Ae. albopictus and Cx. quinquefasciatus. Numbers indicate lengths of the repeats, and, for Cx. quinquefasciatus, also length of deviating repeat monomers.

(B) Evolutionary relationships of dipterous genera based on ref³³. Bar lengths are arbitrary and do not reflect evolutionary distances.

(C) Northern blot of tapiR1 in Aag2 cells treated with control dsRNA targeting different transcripts of AAEL017385, or, as control, firefly luciferase (FLuc). Ethidium bromide stained rRNA serves as loading control.

(D) Top panel: Schematic representation of the AAEL017385 locus and satellite repeat. The primer used for 3’ RACE and positions targeted by dsRNA in panel C are indicated with an arrow and wavy lines, respectively.

Bottom panel: 3’ RACE analysis of AAEL017385. Indicated are sequences from the current AaegL5 genome annotation and RACE PCR products. The sequences of the 5’ terminal tapiR1 and 2 repeats are highlighted with colours.

The satellite repeat locus overlaps with the 3’UTR of the gene AAEL017385 (LOC23687805) of unknown function in the current genome annotation (Fig 1A). This organization suggests that one or more splice variants of AAEL017385 are the source of the piRNAs, and that expression of this gene and piRNA biogenesis might be closely linked. However, knockdown of the different splice isoforms did not reduce expression of tapiR1 (Extended Data Fig 3C, Supplementary Fig S1E), arguing against this possibility. Rapid amplification of 3’ ends (RACE) of AAEL017385 transcripts revealed transcription termination sites directly upstream of the first tapiR1 and tapiR2 repeat, respectively (Extended Data Fig 3D). Even though we cannot exclude that some AAEL017385 transcripts overlap with the satellite repeat, our data strongly suggest that the two piRNAs and AAEL017385 are not expressed from the same transcriptional unit. Instead, the satellite repeat locus might be transcribed from an unknown upstream or internal promoter. In support of this notion, the repeat locus is not associated with the AAEL017385 orthologues in Ae. albopictus (AALF011179) and Cx. quinquefasciatus (CPIJ011773).

We next characterized the sequence requirements for target silencing by tapiR1, as its expression is approximately one log higher compared to tapiR2 in Aag2 cells. Using a luciferase reporter harbouring a fully complementary target site in the 3’ UTR, we validated that this piRNA is able to target RNAs in trans. The reporter was silenced more than 10-, or 35-fold compared to a reporter without target site, or a control reporter with a partially inverted target site, respectively (Fig 2A). Addition of an antisense oligonucleotide (AO) complementary to tapiR1 relieved silencing in a concentration-dependent manner (Extended Data Fig 4), confirming that the observed effect is mediated by the piRNA in a sequence-specific fashion. Unlike most miRNAs³⁵, silencing was not dependent on the position of the target site in the mRNA and was efficient in the open reading frame and both the 5’ and the 3’ UTR (Fig 2B). During the course of our study we noticed that the firefly and Renilla luciferase genes, which we used as reporter and normalization control, respectively, contain potential target sites for tapiR1. We confirmed that Renilla luciferase is indeed potently suppressed by tapiR1 and firefly luciferase slightly (Extended Data Fig 5A-C), yet, mutating these target sites did not affect any of the conclusions reported below, but increased the observed effects (Extended Data Fig 5D,E).

Extended Data Figure 4: An antisense oligonucleotide relieves tapiR1-mediated silencing.

(A) Luciferase assay of a reporter with a fully complementary target site for tapiR1 in the 3’ UTR. Cells were co-transfected with the reporter and increasing amounts of a fully 2’O-methylated antisense RNA oligonucleotide (AO), or a control AO. Firefly luciferase activity was normalized to the activity of a co-transfected Renilla luciferase reporter. Indicated are mean, standard deviation and individual measurements from a representative experiment measured in triplicate.

(B) Northern blot detection of tapiR1 in Aag2 cells upon treatment with tapiR1 or control AO in Aag2 cells. Cells were harvested after the indicated time points. Ethidium bromide-stained rRNA serves as loading control.

Extended Data Figure 5: Renilla luciferase contains a functional tapiR1 target site.

(A) Schematic representation of predicted tapiR1 target sites and minimum free energy of the indicated structures in the coding sequences of Renilla luciferase (RLuc) or firefly luciferase (FLuc).

(B) Luciferase assay of Aag2 cells transfected with reporters carrying either a scrambled (scr) site or the predicted target site from firefly luciferase (left panel), or from Renilla luciferase (right panel) from panel (A) in the 3’UTR of FLuc.

(C) Luciferase activity of firefly luciferase or Renilla luciferase construct with synonymous mutations introduced into the predicted tapiR1 target site (Δtapir1 site) or the parental clones.

(D) Luciferase reporter assay of reporters carrying target sites for tapiR1 as indicated in panel A in the 3’UTR of either the parental firefly luciferase, or the ΔtapiR1 firefly luciferase version.

(E) Reporter assay with luciferase carrying tapiR1 target sites with single mismatches in the 3’ UTR as used in Extended Data Fig 6B, using RLuc with a mutated tapiR1 target site (ΔtapiR1 site) for normalization. Left panel is a zoom to the x-axis of the right panel. Shown are mean, standard deviation and individual measurements from representative experiments performed with at least two different clones per construct, and measured in triplicate.

To assess targeting requirements for tapiR1, we introduced mismatches in the piRNA target sites. Three consecutive mismatches were tolerated unless they were located in the t1 to t9 region of the piRNA (the nucleotides based-paired to piRNA positions 1 to 9) (Fig 2C, Extended Data Fig 6A), and single mismatches only impaired silencing at positions t3 to t7 (Extended data Fig 6A,B), reminiscent of a microRNA seed³⁵ and comparable to what has been termed the piRNA seed in C. elegans^36,37. Even though tapiR1 targeting requirements resemble those of miRNAs, the results are unlikely to be due to the piRNA being funnelled into the miRNA pathway. First, tapiR1 biogenesis is not dependent on Ago1 (Extended Data Fig 2E), and secondly, this piRNA is 2’-O-methylated (Fig 1C), a feature of siRNAs and mature PIWI-bound piRNAs, but not Ago1-associated miRNAs. A mismatch at position t1 did not alter silencing, suggesting that the first nucleotide is anchored in a binding pocket of Piwi4, similar to other Argonaute proteins^38-40. Unexpectedly, a mismatch at position t2 was tolerated as well. This was, however, only the case when the rest of the target site was perfectly complementary, but not when the target contained mismatches outside of the seed (Extended Data Fig 6C). We further noticed that, in contrast to C. elegans³⁶, G:U wobble pairs were not tolerated inside the seed and had the same effect as a mismatch at the same position (Extended Data Fig 6D). Whereas the 5’ seed region is normally absolutely required for targeting^36-38,41,42, the 3’ part of the piRNA might increase specificity and efficiency of the targeting. For this reason we further assessed the extent of complementarity needed to allow for tapiR1-mediated silencing. Introduction of increasing numbers of mismatches at the 3’ end did not interfere with silencing when at least half of the piRNA could base pair with the target site (Fig 2D), indicating that the 3’ part of the piRNA is not necessarily required, yet the seed region alone not sufficient for silencing.

Extended Data Figure 6: tapiR1 uses a G:U wobble sensitive seed sequence for target recognition.

(A) Schematic representation of the reporter constructs used in panel B and Figure 2. Numbers indicate the position of the mismatch relative to the 5’ end of the piRNA.

(B) Luciferase assay of reporters carrying a tapiR1 target site with single mismatches in the 3’ UTR.

(C) Luciferase activity of reporters with the tapiR1 target site from RLuc and indicated mismatches in the 3’ UTR of FLuc (left panel). The tapiR1 target duplexes and mutants are presented in the right panel.

(D) Luciferase activity of tapiR1 reporters carrying mismatches or G:U wobble base pairs at the indicated positions. Firefly luciferase activity was normalized to the activity of a co-transfected Renilla luciferase reporter to control for differences in transfection efficiencies. Data represent mean, standard deviation and individual measurements of representative experiments with two independent clones per construct and measured in triplicates.

Taken together, these results suggest that tapiR1 needs relatively low sequence complementarity to efficiently silence targets^36,41-43, and that there are no constraints regarding the position of the target site on the mRNA. As a consequence, tapiR1 may target a plethora of different cellular RNAs that are perfectly base pairing to the seed and additional matches outside the seed.

Considering the fact that the repeat locus is extremely conserved, we hypothesized that this piRNA regulates cellular gene expression. Some satellite repeats can influence genes by modulation of the local chromatin environment in cis ^14,44, or have been hypothesized to induce silencing of genes with homologous repeat insertions²⁹. In contrast, the tapiR1/2 locus has the potential to silence expression of a broad range of remote genes in trans, independent of repeat insertions in target genes, and thus, to regulate diverse and highly complex cellular processes.

To test this idea, we blocked tapiR1-mediated silencing with the tapiR1 AO in Aag2 cells and assessed global gene expression by RNAseq two days after treatment. Intriguingly, expression of 134 genes, amongst which many long non-coding RNAs, was significantly increased up to around 450 fold compared to the treatment with a control oligonucleotide (Fig 2E, Supplementary Table 1). Transposons were not globally affected, although some elements were up-regulated as well, up to around 850-fold (Extended Data Fig 7A, Supplementary Table 2). Expression of deregulated genes, and a transposable element was increased in a concentration-dependent manner upon tapiR1 AO treatment as measured by RT-qPCR (Fig 2F), validating our RNAseq results. We then used RNAHybrid to predict tapiR1 target sites and verified these sequences in luciferase reporter assays. Twelve out of 23 sites in protein-coding genes, lncRNAs, or transposable elements were indeed sufficient to support suppression of the reporter (Extended Data Fig 8), strongly suggesting that tapiR1 directly represses these cellular RNAs, and confirming that tapiR1 only needs limited base pairing to mediate silencing. Computational prediction of target sites has inherent limitations, and not all genes with predicted target sites were differentially regulated upon AO treatment, or, vice versa, target sites were functional in a reporter context, yet the gene itself was not differentially expressed (Extended Data Fig 7B,C). Additionally, the minimum free energy of the piRNA/target duplex was not predictive for the effect size of tapiR1-mediated silencing. Thus, similar to miRNAs⁴⁵, other factors beyond Watson-Crick base pairing seem to play a role in definition of bona fide target genes for tapiR1. Nevertheless, our results indicate that tapiR1 is able to directly and strongly regulate gene expression and transposon RNA levels in a sequence-dependent manner. The expression of satellite repeats is often regulated in a developmental or stage-specific manner^29,46, thus we analysed the expression pattern of tapiR1 and 2 throughout the mosquito life cycle. tapiR1 and 2 were not expressed during the first three hours of embryonic development of Ae. aegypti (Fig 3A), but could be detected in all subsequent life stages (Extended Data Fig 9A,B). At the very beginning of embryonic development the zygotic genome is transcriptionally quiescent and the first mitotic divisions are exclusively driven by maternally deposited transcripts and proteins. Maternal-to-zygotic transition (MZT) is marked by the degradation of these maternal transcripts and concomitant zygotic genome activation⁴⁷. Destabilization of maternal transcripts initially occurs through maternal decay activities, and later, after onset of zygotic transcription also by zygotic components^47,48. One of the best described mechanisms involves zygotically expressed miRNAs, for example miR-430, miR-427, and the miR-309 cluster in zebrafish⁴⁹, Xenopus⁵⁰, and flies⁵¹, respectively. Based on its expression pattern and its strong suppressive ability we hypothesized that tapiR1 could be part of the zygotic degradation pathway in mosquitoes, and necessary for embryonic development. To test this idea we injected either a tapiR1-specific AO or control AO into early pre-blastoderm Ae. aegypti embryos (before zygotic genome activation and expression of tapiR1) (Fig 3B), and assessed their development using discrete scoring schemes (Supplementary Fig 1F). Strikingly, more than 90 % of all tapiR1 AO-injected embryos were arrested early in development, whereas about half of all control embryos showed obvious signs of developmental progression (Fig 3C). In accordance, only a small fraction of tapiR1 AO-injected embryos hatched (Fig 3D) and continued to develop as larvae, suggesting that tapiR1-deficiency impedes development. RNA sequencing from tapiR1 AO or control AO-injected embryos 20.5 h after injection revealed massive deregulation of cellular transcripts (Fig 3E, Extended Data Fig 9C, Supplementary Table 3, 4). Expression of 205 genes, among which 44 lncRNAs, as well as few transposable elements was increased up to around 1000 and 500 fold in tapiR1 AO-treated embryos, respectively. Target genes with predicted tapiR1 target sites were more strongly up-regulated than genes without predicted site, as indicated by a shift of the cumulative distribution of RNA fold changes (Fig 3F). These findings show that tapiR1 controls regulatory circuits by direct gene targeting also in vivo, and that this function is essential for embryonic development, likely by promoting mRNA turnover of a subset of maternal transcripts. In line with this conclusion, confirmed target genes are down-regulated after the onset of tapiR1 expression (Fig 3G), and tapiR1 targets are overrepresented in transcripts that are maternally provided and degraded during MZT (Fig 3H).

Extended Data Figure 7: tapiR silences gene expression in Aag2 cells.

(A) log2 mRNA expression of transposable elements in Aag2 cells treated with a tapiR1 specific antisense oligonucleotide (AO) or control AO. Depicted are the means of three biological replicates. A pseudo-count of one was added to all values in order to plot values of zero. Diagonal lines represent a fold change of two. Significance was tested at an FDR of 0.01 and a log2 fold change of 0.5.

(B) log2 fold changes of genes upon treatment with tapiR1 or control AO in Aag2 cells (left) and mosquito embryos (right) plotted against the minimum free energy of predicted tapiR1-target duplexes. Blue dots indicate target sites that were confirmed to be functional, and red dots indicate target sites that were not functional in luciferase reporter assays (see Extended Data Fig 8).

(C) Violin plot of log2 fold changes of all genes in Aag2 cells (left) and mosquito embryos (right), either with or without predicted tapiR1 target site.

Extended Data Figure 8: Validation of tapiR1 target genes.

(A) Predicted structures and minimum free energy of tapiR1/target duplexes analysed in panel B.

(B) Luciferase assay of reporters carrying the predicted target site from panel A in the 3’ UTR of firefly luciferase. Firefly luciferase activity was normalized to the activity of a co-transfected Renilla luciferase reporter to control for differences in transfection efficiencies. Indicated are mean, standard deviation and individual measurements from representative experiments performed with two to three independent clones per construct and measured in triplicates.

(C) AAEL017422, AAEL001741, and AAEL000453 were annotated in the previous AaegL3 gene set, but not in the current AaegL5 gene set. Read coverage in tapiR1 AO and control AO treated Aag2 cells at these genomic regions suggests that these regions are actively transcribed, but repressed by tapiR1. Red boxes indicate the positions of tapiR1 target sites.

Extended Data Figure 9: tapiR1 regulates gene expression in mosquito embryos.

(A, B) Northern blot analysis of tapiR1 and 2 in developmental stages of Ae. aegypti mosquitoes (A), or at different time points after blood feeding (B). U6 snRNA (A) or ethidium bromide-stained rRNA (B) were analyzed to verify equal loading.

(C) log2 mRNA expression of transposable elements in embryos injected with tapiR1-specific or control AO. Mean counts of five biological replicates are shown. Significance was tested at an FDR of 0.01 and a log2 fold change of 0.5. Diagonal lines indicate a fold change of two.

Supplementary Data Figure S1: Antibody validation, uncropped Western blot images, knockdown efficiencies, and scoring scheme for the development of Ae. aegypti embryos.

(A) Validation of Ae. aegypti PIWI antibodies. Specificity was confirmed by detection of an additional band in PTH-tagged PIWI-expressing Aag2 cells, and loss of signal upon dsRNA-mediated knockdown. Knockdown with dsRNA targeting RLuc (dsRLuc) serves as negative control.

(B) Uncropped Western blot images corresponding to Extended Data Fig 2B.

(C-E) Knockdown efficiencies of PIWI genes shown in Extended Data Fig 2D (D), siRNA and miRNA pathway genes shown in Extended Data Fig 2E (E), and AAEL017385 isoforms in the experiment shown in Extended Data Fig 3C (F)

(F) Representative images of embryos scored as either undeveloped, intermediate or fully developed at 2.5 days post injection with antisense RNA oligonucleotides.

Figure 3: tapiR1 is essential for embryonic development in vivo by promoting turnover of maternally deposited transcripts.

(A) Expression of tapiR1 and 2 as analysed by northern blot in Ae. aegypti embryos. Time indicates the age of the embryos after a 30 min egg laying period. For each time point, around 50 to 150 eggs were pooled.

(B) Outline of the experimental procedure.

(C) Percent of embryos injected with either tapiR1 or control AO that reached the indicated developmental stages 2.5 days post injection. Individual embryos were scored as either undeveloped, intermediate or fully developed, as shown in Supplementary Figure S1F.

(D) Percent of embryos injected with tapiR1 or control AO that hatched four days post injection. Box-whiskers plot represents mean, first and third quartile and maximum and minimum of the data. Points show the individual experiments with 20 to 60 (C), or 50 to 150 (D) embryos per group.

(E) log2 expression of genes in embryos injected with tapiR1 specific or a control AO at 20.5 h post injection. Mean counts from five biological replicates plus a pseudo-count of one are plotted. Per replicate, 50 embryos per group were pooled. Significance was tested at an FDR of 0.01 and a log2 fold change of 0.5. Diagonal lines highlight a fold change of two.

(F) Experimental cumulative distribution of log2 fold changes of genes without or with predicted target sites for tapiR1. Target sites were grouped based on the predicted minimum free energy of the piRNA/target duplex.

(G) Expression of tapiR1 target genes in Ae. aegypti embryos. RT-qPCR was performed on samples shown in (A). Abd-A is a gene not targeted by tapiR1 and serves as negative control.

(H) Fraction of genes in different classes of genes expressed in embryos between 0 and 16 h post egg-laying.

piRNAs have been shown to promote degradation of nanos⁵² and other transcripts involved in germ cell development⁵³. Yet, this was dependent on transposon-derived piRNAs, and rather depicts a re-purposing of the existing piRNA pool, which is, due to its large targeting potential, ideal to be used to degrade a large number of transcripts. In contrast, we propose that, analogous to abundant miRNAs in other animals^49-51, Culicinae mosquitoes have evolved a specific piRNA to destabilize a defined set of maternally deposited transcripts in early embryonic development. To our knowledge, this is the first example of sequence-specific gene silencing by transcriptional products from a satellite repeat in trans, and underlines the regulatory potential of tandemly repeated DNA.

Methods

Cell culture

Aedes aegypti Aag2 cells were cultured in Leibovitz’s L-15 medium (Invitrogen) supplemented with 10 % heat-inactivated Fetal Bovine Serum (PAA Laboratories), 2 % Tryptose Phosphate Broth Solution (Sigma Aldrich), 1x MEM Non-Essential Amino Acids (Invitrogen) and 50 U/ml penicillin/streptomycin (Invitrogen) at 25 °C.

Mosquito rearing

Injection and northern blot of embryos was performed using a Cell fusion agent virus-free, isofemale Aedes aegypti strain called Jane. This strain was initiated from a field population originally sampled in the Muang District of Kamphaeng Phet Province, Thailand⁵⁴, and reared for 26 generations at 28 ±1°C, 75±5 % relative humidity, 12:12 hour light-dark cycle. Embryos were hatched under low pressure for 30-60 min. Larvae were grown in dechlorinated tap water and fed fish food powder (Tetramin) every two days. Adults were maintained in cages with constant access to a 10% sucrose solution. Female mosquitoes were fed on commercial rabbit blood (BCL) through a membrane feeding system (Hemotek Ltd.) using pig intestine as membrane. For AO injections, female mosquitoes were transferred to 25 °C and 70 % humidity for at least two days before forced to lay eggs, and embryos were then placed back to 28 °C immediately after the injection. For the time-course experiment in Fig. 3A,G, embryos were kept at 25 °C during the course of the experiment.

All other in vivo experiments were performed with the Ae. aegypti Rockefeller strain, obtained from Bayer AG, Monheim, Germany. The mosquitoes were maintained at 27±°C with 12:12 hour light:dark cycle and 70% relative humidity, as described before⁵⁵.

Mosquitoes used in Fig 1F were either different laboratory-reared, or wild-caught species: Aedes aegypti Liverpool strain, Culex pipiens, Anopheles coluzzii, An. quadriannulatus, An. stephensi mosquitoes, Culicoides nubeculosus biting midges, and D. melanogaster w¹¹¹⁸ flies were laboratory strains. The mosquitoes were deep-frozen and stored at −80 °C until use. Ae. albopictus, Ae. cantans, Ae. intrudens, Ae. pullatus, Ae. cinereus, Ae. vexans, Cx. pusillus, Culiseta morsitans, Coquillettidia richiardii, An. maculipennis, An. claviger, and An. coluzzii were wild-caught individuals collected in different regions in Italy, Sweden, or the Netherlands between July 2014 and June 2015⁵⁶. Species were identified at the species level, and stored at −20 °C for a maximum of two years.

Gene knockdown

Double-stranded RNA was generated by in vitro transcription of T7 promoter-flanked PCR products with T7 RNA polymerase. Primer sequences are given in Supplementary Table S5. The reaction was carried out at 37°C for 3 to 4 h, then heated to 80 °C for 10 min and gradually cooled down to room temperature to facilitate dsRNA formation. The dsRNA was purified with the GeneElute Total RNA Miniprep Kit (Sigma Aldrich).

Aag2 cells were seeded in 24-well plates the day before the experiment, and transfected with X-tremeGENE HP Transfection reagent (Roche) according to the manufacturer’s instructions, using a ratio of 4 μL reagent per μg of dsRNA. The transfection medium was replaced after 3 h with fully supplemented Leibovitz-15 medium and RNA was harvested 48 h later. Knockdown was confirmed by RT-qPCR.

RNA isolation

RNA from cells and mosquitoes was isolated with Isol-RNA lysis buffer (5PRIME) according to the manufacturer’s instructions. Briefly, 200 μL chloroform was added to 1 mL lysis buffer, and centrifuged at 16,060 x g for 20 min at 4 °C. Isopropanol was added to the aqueous phase, followed by incubation on ice for at least one hour, and centrifugation at 16,060 x g for 10 min at 4 °C. The pellet was washed three to five times with 85 % ethanol and dissolved in RNase free water. RNA was quantified on a Nanodrop photospectrometer.

Periodate treatment and β-elimination

Total RNA was treated with 25 mM NaIO₄ in a final concentration of 60 mM borax and 60 mM boric acid (pH 8.6) for 30 min at room temperature. In the control, NaIO₄ was replaced by an equal volume of water. The reaction was quenched with glycerol and β-elimination was induced with a final concentration of 40 mM NaCl for 90 min at 45 °C. RNA was ethanol precipitated and analysed with northern blot.

Generation of antibodies

Custom-made antibodies (Eurogentec) against endogenous PIWI proteins were generated by immunization of two rabbits per antibody with a mix of two unique peptides (Ago3: TSGADSSESDDKQSS, IIYKRKQRMSENIQF; Piwi4: HEGRGSPSSRPAYSS, HHRESSAGGRERSGN; Piwi5: DIVRSRPLDSKVVKQ, CANQGGNWRDNYKRAI; Piwi6: MADNPQEGSSGGRIR, RGDHRQKPYDRPEQS). After 87 days and a total of four immunizations (t=0, 14, 28, 56 days), sera of both rabbits were collected, pooled, and purified against each peptide separately. Specificity of the antibody was confirmed by Western blotting of Aag2 cells stably expressing PTH (ProteinA, TEV cleavage site, 6x His-tag)-tagged PIWI⁵⁷ upon knockdown of the respective PIWI protein, or a control knockdown (dsRLuc) (see Supplementary Fig S1A).

Immunoprecipitation and western blotting

Aag2 cells were lysed with RIPA buffer (10 mM Tris-HCl, 150 mM NaCl, 0.5 mM EDTA, 0.1 % SDS, 1 % Triton-X-100, 10 % DOC, 1x protease inhibitor cocktail), supplemented with 10 % glycerol and stored at −80 °C until use. The IP was performed with custom-made antibodies against Piwi4-6 and Ago3 (1:10 dilution) at 4 °C for 4 h on rotation. Protein A/G Plus beads (Santa Cruz) were added at a dilution of 1:10 and then incubated overnight at 4 °C on rotation. Beads were washed 3 times with RIPA buffer, and half was used for RNA isolation and protein analysis each. For RNA extraction, beads were treated with proteinase K for 2 h at 55 °C and isolated with phenol-chloroform extraction. Equal amounts of RNA for input and IPs were then analysed by northern blotting. For western blotting, the IP samples were boiled in 2x Laemmli buffer for 10 min at 95 °C, separated on 7.5 % SDS-polyacrylamide gels, and blotted on 0.2 μm nitrocellulose membranes (Bio-Rad) in a wet blot chamber on ice. Membranes were blocked for 1 h with 5 % milk in PBS-T (137 mM NaCl, 12 mM phosphate, 2.7 mM KCl, pH 7.4, 0.1 % (v/v) Tween 20) and incubated with PIWI-specific (dilution 1:1000) and Tubulin primary antibodies (rat anti-Tubulin alpha, MCA78G, 1:1000, Sanbio) overnight at 4 °C. The next day, membranes were washed three times with PBS-T and incubated with secondary antibodies conjugated to a fluorescence dye (IRDye 800CW conjugated goat anti rabbit, 1:10,000, Li-Cor, and IRDye 680LT conjugated goat anti rat, 1:10,000, Li-Cor) for 1 h at room temperature in the dark. After washing three times in PBS-T, signal was detected with the Odyssey-CLx Imaging system (Li-Cor).

Northern blot

piRNAs were detected by northern blot analyses, as published in ref.⁵⁸. Briefly, RNA was denatured at 80 °C for 2 min in Gel Loading Buffer II (Ambion) and size-separated on 0.5 x TBE (45 mM Tris-borate, 1 mM EDTA), 7 M Urea, 15 % denaturing polyacrylamide gels. RNA was then blotted on Hybond-NX nylon membranes (GE Healthcare) in a semi-dry blotting chamber for 45 min at 20 V and 4 °C and crosslinked to the membrane with EDC crosslinking solution (127 mM 1-methylimidazole (Sigma-Aldrich), 163 mM N-(3-dimethylaminopropyl)-N&-ethylcarbodiimide hydrochloride (Sigma-Aldrich), pH 8.0) at 60 °C for two hours. Crosslinked membranes were pre-hybridized in ULTRAHyb-Oligo hybridization buffer (Thermo Scientific) for one hour at 42 °C and probed with indicated ³²P 5’ end-labelled DNA oligonucleotide probes over night at 42 °C. Membranes were then washed with decreasing concentrations of SCC (300 mM NaCl, 30 mM sodium citrate pH 7.0; 150 mM NaCl, 15 mM sodium citrate; 15 mM NaCl, 1.5 mM sodium citrate) and 0.1 % SDS, and exposed to Carestream BioMax XAR X-Ray films (Kodak). Probe sequences can be found in the Supplementary Table S5.

Reporter cloning and luciferase assay

Reporters were constructed by cloning annealed and phosphorylated oligonucleotides with the indicated tapiR1 or control target sites in the pMT-GL3 vector⁵⁹. This vector encodes the Photinus pyralis firefly luciferase (GL3) under a copper-inducible metallothionein promoter. Sense and antisense oligonucleotides (Sigma Aldrich) were annealed by heating to 80 °C, and gradually cooling down to room temperature, phosphorylated with T4 polynucleotide kinase (Roche) at 37 °C for 30 min, purified and then ligated into the pMT-GL3 vector. For cloning of 3’UTR and 5’ UTR reporters, the target site or the target site and an upstream BamHI site were cloned into the PmeI and SacII, or NotI and XhoI restriction sites, respectively. ORF reporters were constructed by cloning a Kozak sequence followed by the first 45 nucleotides of luciferase and the target site into XhoI and NcoI sites. Sequences of the oligonucleotides are provided in Supplementary Table S5. Where indicated, mutated firefly or Renilla luciferase versions were used that harbour synonymous mutations destroying the predicted target sites for tapiR1 (firefly luciferase: 782 gagtcgtcttaatgtatagatttgaagaa 810 mutated to 782 gtgtcgtgcttatgtaccggttcgaggag 810, and Renilla luciferase 462 tgaatggcctgatattgaagaa 483 mutated to 462 tgagtggccagatatcgaggag 483; modified nucleotides in bold).

Aag2 cells were seeded in 96-well plates the day before the experiment and transfected with 100 ng of the indicated plasmids and 100 ng pMT-Renilla⁵⁹ per well, using 2 μL X-tremeGENE HP DNA transfection reagent per 1 μg plasmid DNA according to the manufacturer’s instructions. Alternatively, 100 ng reporter plasmid and 100 ng pMT-Renilla were co-transfected with the indicated amounts of unlabelled, fully 2’O-methylated antisense RNA oligonucleotide using an additional amount of 4 μL X-tremeGENE HP DNA transfection reagent (Roche) per 1 μg oligonucleotide. Medium was replaced 3 h after reporter plasmid transfection with 0.5 mM CuSO₄ in fully supplemented Leibovitz’s L-15 medium to induce the metallothionein promoter. 24 h later, cells were lysed in 30 μL Passive lysis buffer (Promega) and activity of both luciferases was measured in 10 μL of the sample with the Dual Luciferase Reporter Assay system (Promega) on a Modulus Single Tube Reader (Turner Biosystems). Firefly luciferase was normalized to Renilla luciferase activity. For each construct, at least two to three independent clones were measured in triplicate.

RT-qPCR

1 μg of total RNA was treated with DNaseI (Ambion) for 45 min at 37 °C and reverse transcribed using the Taqman reverse transcription kit (Applied Biosystems) according to the manufacturer’s protocol. Real-time PCR was performed with the GoTag qPCR Master Mix (Promega) and measured on a LightCycler480 instrument (Roche) with 5 min initial denaturation and 45 cycles of 5 s denaturation at 95 °C, 10 s annealing at 60 °C and 20 s amplification at 72 °C. Starting fluorescence values of specific mRNAs were calculated with linear regression method of log fluorescence per cycle number and LinRegPCR program, version 2015.3, as described in ref ⁶⁰.

3’ RACE

3’ Rapid Amplification of cDNA Ends (3’ RACE) was performed using the FirstChoice RLM-RACE Kit (Thermo Fischer Scientific) according to the manufacturer’s instructions. Amplification products were separated on agarose gel, purified and Sanger sequenced. Primer sequences can be found in Supplementary Table S1.

Blood feeding experiment

Naïve female Aedes aegypti (Liverpool strain) mosquitoes were offered human blood (Sanquin Blood Supply Foundation, Nijmegen, The Netherlands) through a Parafilm membrane using the Hemotek PS5 feeder (Discovery Workshops). Five engorged females were selected and sacrificed at each of the indicated time points. RNA was isolated as described above.

tapiR1 antisense oligonucleotide treatment and injection

Aag2 cells were seeded in 24-well plates the day before the experiment. Cells were treated with 500 nM 5’Cy5-labelled, fully 2’O-methylated antisense RNA oligonucleotide in 530 μL medium with 4 μL X-tremeGENE HP DNA transfection reagent (Roche) per 1 μg oligonucleotide. Medium was replaced after 3 h and cells from three independent experiments were harvested 48 h after transfection and prepared for RNA sequencing (see below).

For injection of embryos, engorged female mosquitoes that were kept at 25 °C and 70 % humidity were allowed to lay eggs for 45 min. Embryos were desiccated for 1.5 min, covered with Halocarbon oil (Sigma Aldrich) and injected with 50 μM 5’Cy5-labelled, fully 2’O-methylated antisense RNA oligonucleotide with a FemtoJet 4x (Eppendorf) with 1200 hPa pressure. Injected embryos were then transferred to a wet Whatman paper and kept at 27 °C and 80 % humidity for the indicated times. Per experiment, 50 to 150 embryos were injected per condition.

Scoring of embryo development and hatching

Injected embryos were allowed to develop for 2.5 days after injection on a moist Whatman paper and then fixed in 4% paraform aldehyde for 8 h to overnight. Afterwards, the pigment of the endochorion was bleached with Trpis solution⁶¹ (0.037 M sodium chlorite, 1.45 M acetic acid) for 24 to 48 h. Embryos were washed five times in PBS and images were taken with a EVOS FL imaging system (Thermo Fisher Scientific). Embryos with evident larval segmentation (head, fused thoracical elements and abdomen) were scored as fully developed and embryos without any evident structure of the ooplasm as undeveloped. Individuals that showed first signs of structural rearrangements of the ooplasm, but did not complete larval segmentation were scored as intermediate (see Supplementary Fig S1F). To avoid biases, the scoring was performed blindly. Hatching rate was counted from injected embryos 4 days post injection. Embryos were kept moist for two days and then allowed to slowly dry for the rest of the period. The embryos were transferred to water and then forced to hatch by applying negative pressure for a period of 30 min. The number of hatched L1 larvae was counted immediately afterwards.

Sequence logo

Repeat monomers from the satellite repeat loci in Ae. aegypti, Ae. albopictus, and Cx. quinquefasciatus were extracted manually from the current genome annotations obtained from Vectorbase (Aedes aegypti Liverpool AaegL5, Aedes albopictus Foshan AaloF1, Culex quinquefasciatus Johannesburg CpipJ2). A repeat unit was defined as the sequence starting from the first tapiR1 nucleotide until one nucleotide upstream of the next tapiR1 sequence. Sequences were aligned using MAFFT (v7.397)⁶² (with options –genafpair –leavegappyregion --kimura 1 -- maxiterate 1000 --retree 1) and the sequence logo was constructed with the R package ggseqlogo ⁶³.

Small RNA sequencing

Small RNAs from Aag2 cells (input) or PIWI immunoprecipitations were cloned with the TruSeq small RNA sample preparation kit (Illumina) according to the manufacturer’s instructions. For the input sample, size selected 19-33 nt small RNAs purified from polyacrylamide gel were used to construct the library as described previously⁶⁴, whereas IP samples were not extracted from gel.

Libraries were sequenced on an Illumina HiSeq 4000 instrument by Plateforme GenomEast (Strasbourg, France).

mRNA sequencing

RNA was isolated from Aag2 cells 48 h after AO transfection (three independent experiments), or embryos 20.5 h after AO injection (50 embryos pooled per experiment from five independent experiments) with RNAsolv reagent following standard phenol-chloroform extraction (see above). Polyadenylated RNAs were extracted and sequencing libraries were prepared using the TruSeq stranded mRNA Library Prep kit (Illumina) following the manufacturer’s instructions, and sequenced on an Illumina Hi Seq 4000 instrument (2×50 bases).

Analysis of mRNA sequencing

Reads were mapped to the Ae. aegypti genome AaegL5 as provided by VectorBase (https://www.vectorbase.org) with STAR (version 2.5.2b)⁶⁵ in 2-pass mode: first mapping was done for all samples (options: --readFilesCommand zcat --outSAMtype None --outSAMattrIHstart 0 --outSAMstrandField intronMotif), identified splice junctions were combined (junctions located on the mitochondrial genome were filtered out, as these are likely false positives), and this list of junctions was used in a second round of mapping (with –sjdbFileChrStartEnd) and default parameters as above. Reads were quantified with the additional option –quantMode GeneCounts to quantified reads per gene. Alternatively, reads were quantified on TEfam transposon consensus sequences (https://tefam.biochem.vt.edu/tefam/get_fasta.php) with Salmon (v.0.8.2)⁶⁶, default settings and libType set to “ISR”. Statistical and further downstream analyses were performed with DESeq2⁶⁷ from Bioconductor. Significance was tested at an FDR of 0.01 and a log2 fold change of 0.5. tapiR1 target sites were predicted with the online tool from RNAHybrid⁶⁸ with helix constraints from nucleotide two to seven, and no G:U wobble allowed in the seed. Predictions were made on the AaegL5.1 geneset as provided by VectorBase, and on TEfam transposon consensus sequences. For Fig4H, publicly available sequencing datasets⁶⁹ (accession numbers: SRR923702, SRR923826, SRR923837, SRR923853, SRR923704) were mapped and quantified as described above. Genes were categorized on the basis of their expression in embryos at 0-2 h vs. 12-16 h post egg-laying. Genes not detected in the 0-2 h sample were defined as purely zygotic, and genes that did not increase or decrease by more than log₂(0.5) as maternal stable. Genes that changed in expression by more than log₂(0.5), log₂(2), and log₂(5) from 0-2 h to 12-16 h were categorized as maternal unstable fraction (decreased expression), or as genes that are maternally provided but are transcribed by the zygote as addition to the preloaded maternal pool (increased expression). tapiR1 targets were defined as genes that were significantly upregulated at least two fold in tapiR1 AO injected embryos and harbour a predicted tapiR1 target site (mfe <= −24). The code will be made available on GitHub upon publication.

Analysis of small RNA sequencing

3’ sequencing adapters (TGGAATTCTCGGGTGCCAAGG) were trimmed from the sequence reads with Cutadapt (version 1.14)⁷⁰ and trimmed reads were mapped with Bowtie (version 0.12.7)⁷¹ to the Aedes aegypti LVP_AGWG genome sequence AaegL5.1 obtained from VectorBase with at most 1 mismatch. Reads that mapped to rRNAs or tRNAs were excluded from the analyses. Alternatively, 3’sequencing adapters ((NNN)TGGAATTCTCGGGTGCCAAGGC) and three random bases were trimmed from publicly available datasets from Ae. aegypti somatic and germline tissues⁷² (SRR5961503, SRR5961504, SRR5961505, SRR5961506) and then processed as described above. Oxidized libraries, IPs and input sample were normalized to the total number of mapped reads, all other libraries to the total number of miRNAs (in millions). piRNAs that were at least two fold enriched in a PIWI-IP compared to the corresponding input sample and were present with at least 10 rpm in the IP sample were considered PIWI-bound. Mapping positions were overlapped with basefeatures and repeatfeatures retrieved from VectorBase and counted with bedtools⁷³. Reads that mapped to two or more features were assigned to only one feature with the following hierarchy: open reading frames > non-coding RNAs (incl. lncRNAs, pseudogenes, snoRNAs, snRNAs, miRNAs) > LTR retrotransposons > Non-LTR retrotransposons (SINEs, LINEs, Penelope) > “Cut and paste” DNA transposons > other DNA transposons (Helitrons, MITEs) > satellite and tandem repeat features > DUST > other /unknown repeats. Accordingly, reads that mapped to a repeat feature and an intron or UTR were classified as repeat-derived, whereas all other reads mapping to introns or UTRs were considered as gene-derived. Positions not overlapping with any annotation were summarized as “other”. Results were then visualized with ggplot2⁷⁴ or Gviz⁷⁵ in R.

The code will be made available on GitHub upon publication.

Data availability

Raw sequence data is deposited in the NCBI Sequence Read Archive under the BioProject number PRJNA482553.

Supplementary Information

Supplementary Table S1: Differentially expressed genes upon tapiR1 AO treatment in Aag2 cells.

Supplementary Table S2: Differentially expressed transposable elements upon tapiR1 AO treatment in Aag2 cells.

Supplementary Table S3: Differentially expressed genes upon tapiR1 AO treatment in embryos.

Supplementary Table S4: Differentially expressed transposable elements upon tapiR1 AO treatment in embryos.

Supplementary Table S5: Oligonucleotide sequences used in this study.

Author contributions

R.H., P.M., and R.P.v.R designed the experiments and analyzed the data. R.H. performed the computational analyses and most of the experiments, except for PIWI-IPs for small RNA sequencing (J.J. and E.T.), design and validation of PIWI antibodies (B.P.), and tissue isolations and blood feeding experiment (C.B.F.V. and C.J.K.). S.H.M. and L.L. helped with optimizing embryo injections. R.H. and R.P.v.R. wrote the paper. All authors read and contributed to the manuscript.

Author information

The authors declare no competing financial interests.

Acknowledgments

We thank past and current members of the Van Rij laboratory for discussions. We are grateful to Anna Beth Crist and Artem Baidaliuk for their help with mosquito rearing and embryo injections, and to Catherine Bourgouin and Nicolas Puchot for assistance with the microinjection apparatus. We thank Bas Dutilh for his support with analyzing target site enrichments, and Geert-Jan van Gemert for kindly providing An. stephensi mosquitoes. Tim Möhlmann is acknowledged for providing wild-caught mosquito samples.

This work is financially supported by a Consolidator Grant from the European Research Council under the European Union’s Seventh Framework Programme (grant number ERC CoG 615680) and a VICI grant from the Netherlands Organization for Scientific Research (grant number 016.VICI.170.090). A stay of R.H. at Pasteur Institute, Paris, France was supported by ERASMUS+.

References

↵
Garrido-Ramos, M. A. Satellite DNA: An Evolving Topic. Genes (Basel) 8, doi:10.3390/genes8090230 (2017).
OpenUrl CrossRef
↵
Brajkovic, J., Feliciello, I., Bruvo-Madaric, B. & Ugarkovic, D. Satellite DNA-like elements associated with genes within euchromatin of the beetle Tribolium castaneum. G3 (Bethesda) 2, 931–941, doi:10.1534/g3.112.003467 (2012).
OpenUrl CrossRef
↵
Kuhn, G. C., Kuttler, H., Moreira-Filho, O. & Heslop-Harrison, J. S. The 1.688 repetitive DNA of Drosophila: concerted evolution at different genomic scales and association with genes. Mol Biol Evol 29, 7–11, doi:10.1093/molbev/msr173 (2012).
OpenUrl CrossRef PubMed Web of Science
↵
Girard, A., Sachidanandam, R., Hannon, G. J. & Carmell, M. A. A germline-specific class of small RNAs binds mammalian Piwi proteins. Nature 442, 199–202, doi:10.1038/nature04917 (2006).
OpenUrl CrossRef PubMed Web of Science
↵
Lau, N. C. et al. Characterization of the piRNA complex from rat testes. Science (80-) 313, 363–367, doi:10.1126/science.1130164 (2006).
OpenUrl Abstract/FREE Full Text
↵
Melters, D. P. et al. Comparative analysis of tandem repeats from hundreds of species reveals unique insights into centromere evolution. Genome Biol 14, R10, doi:10.1186/gb-2013-14-1-r10 (2013).
OpenUrl CrossRef PubMed
↵
Kit, S. Equilibrium sedimentation in density gradients of DNA preparations from animal tissues. J Mol Biol 3, 711–716 (1961).
OpenUrl PubMed Web of Science
↵
Sueoka, N. Variation and heterogeneity of base composition of deoxyribonucleic acids: a compilation of old and new data. J. Mol. Biol. 3, 31–40 (1961).
OpenUrl CrossRef PubMed Web of Science
↵
Hall, I. M. et al. Establishment and maintenance of a heterochromatin domain. Science (80-) 297, 2232–2237, doi:10.1126/science.1076466 (2002).
OpenUrl Abstract/FREE Full Text
Menon, D. U., Coarfa, C., Xiao, W., Gunaratne, P. H. & Meller, V. H. siRNAs from an X-linked satellite repeat promote X-chromosome recognition in Drosophila melanogaster. Proc Natl Acad Sci U S A 111, 16460–16465, doi:10.1073/pnas.1410534111 (2014).
OpenUrl Abstract/FREE Full Text
Zakrzewski, F. et al. Epigenetic profiling of heterochromatic satellite DNA. Chromosoma 120, 409–422, doi:10.1007/s00412-011-0325-x (2011).
OpenUrl CrossRef PubMed Web of Science
Fukagawa, T. et al. Dicer is essential for formation of the heterochromatin structure in vertebrate cells. Nat Cell Biol 6, 784–791, doi:10.1038/ncb1155 (2004).
OpenUrl CrossRef PubMed Web of Science
Kanellopoulou, C. et al. Dicer-deficient mouse embryonic stem cells are defective in differentiation and centromeric silencing. Genes Dev 19, 489–501, doi:10.1101/gad.1248505 (2005).
OpenUrl Abstract/FREE Full Text
↵
Pezer, Z. & Ugarkovic, D. Satellite DNA-associated siRNAs as mediators of heat shock response in insects. RNA Biol 9, 587–595, doi:10.4161/rna.20019 (2012).
OpenUrl CrossRef PubMed Web of Science
May, B. P., Lippman, Z. B., Fang, Y., Spector, D. L. & Martienssen, R. A. Differential regulation of strand-specific transcripts from Arabidopsis centromeric satellite repeats. PLoS Genet 1, e79, doi:10.1371/journal.pgen.0010079 (2005).
OpenUrl CrossRef PubMed
↵
Volpe, T. A. et al. Regulation of heterochromatic silencing and histone H3 lysine-9 methylation by RNAi. Science (80-) 297, 1833–1837, doi:10.1126/science.1074973 (2002).
OpenUrl Abstract/FREE Full Text
↵
Matthews, B. J. et al. Improved Aedes aegypti mosquito reference genome assembly enables biological discovery and vector control. bioRxiv (2017).
↵
Siomi, M. C., Sato, K., Pezic, D. & Aravin, A. A. PIWI-interacting small RNAs: the vanguard of genome defence. Nat Rev Mol Cell Biol 12, 246–258, doi:10.1038/nrm3089 (2011).
OpenUrl CrossRef PubMed
↵
Czech, B. & Hannon, G. J. One Loop to Rule Them All: The Ping-Pong Cycle and piRNA-Guided Silencing. Trends Biochem Sci 41, 324–337, doi:10.1016/j.tibs.2015.12.008 (2016).
OpenUrl CrossRef PubMed
↵
Arensburger, P., Hice, R. H., Wright, J. A., Craig, N. L. & Atkinson, P. W. The mosquito Aedes aegypti has a large genome size and high transposable element load but contains a low proportion of transposon-specific piRNAs. BMC Genomics 12, 606, doi:10.1186/1471-2164-12-606 (2011).
OpenUrl CrossRef PubMed
↵
Kawaoka, S., Izumi, N., Katsuma, S. & Tomari, Y. 3’ end formation of PIWI-interacting RNAs in vitro. Mol Cell 43, 1015–1022, doi:10.1016/j.molcel.2011.07.029 (2011).
OpenUrl CrossRef PubMed Web of Science
Saito, K. et al. Pimet, the Drosophila homolog of HEN1, mediates 2’-O-methylation of Piwi-interacting RNAs at their 3’ ends. Genes Dev 21, 1603–1608, doi:10.1101/gad.1563607 (2007).
OpenUrl Abstract/FREE Full Text
↵
Horwich, M. D. et al. The Drosophila RNA methyltransferase, DmHen1, modifies germline piRNAs and single-stranded siRNAs in RISC. Curr Biol 17, 1265–1272, doi:10.1016/j.cub.2007.06.030 (2007).
OpenUrl CrossRef PubMed Web of Science
↵
Miesen, P., Joosten, J. & van Rij, R. P. PIWIs Go Viral: Arbovirus-Derived piRNAs in Vector Mosquitoes. PLoS Pathog 12, e1006017, doi:10.1371/journal.ppat.1006017 (2016).
OpenUrl CrossRef
↵
Miesen, P., Girardi, E. & van Rij, R. P. Distinct sets of PIWI proteins produce arbovirus and transposon-derived piRNAs in Aedes aegypti mosquito cells. Nucleic Acids Res 43, 6545–6556, doi:10.1093/nar/gkv590 (2015).
OpenUrl CrossRef PubMed
↵
Schnettler, E. et al. Knockdown of piRNA pathway proteins results in enhanced Semliki Forest virus production in mosquito cells. J Gen Virol 94, 1680–1689, doi:10.1099/vir.0.053850-0 (2013).
OpenUrl CrossRef PubMed Web of Science
↵
Varjak, M. et al. Aedes aegypti Piwi4 Is a Noncanonical PIWI Protein Involved in Antiviral Responses. mSphere 2, doi:10.1128/mSphere.00144-17 (2017).
OpenUrl Abstract/FREE Full Text
↵
Plohl, M. et al. Long-term conservation vs high sequence divergence: the case of an extraordinarily old satellite DNA in bivalve mollusks. Heredity (Edinb) 104, 543–551, doi:10.1038/hdy.2009.141 (2010).
OpenUrl CrossRef PubMed Web of Science
↵
Li, Y. X. & Kirby, M. L. Coordinated and conserved expression of alphoid repeat and alphoid repeat-tagged coding sequences. Dev Dyn 228, 72–81, doi:10.1002/dvdy.10355 (2003).
OpenUrl CrossRef PubMed
↵
Martinez-Lage, A., Rodriguez-Farina, F., Gonzalez-Tizon, A. & Mendez, J. Origin and evolution of Mytilus mussel satellite DNAs. Genome 48, 247–256, doi:10.1139/g04-115 (2005).
OpenUrl CrossRef PubMed
↵
Henikoff, S., Ahmad, K. & Malik, H. S. The centromere paradox: stable inheritance with rapidly evolving DNA. Science (80-) 293, 1098–1102, doi:10.1126/science.1062939 (2001).
OpenUrl Abstract/FREE Full Text
↵
Plohl, M., Mestrovic, N. & Mravinac, B. Satellite DNA evolution. Genome Dyn 7, 126–152, doi:10.1159/000337122 (2012).
OpenUrl CrossRef
↵
Reidenbach, K. R. et al. Phylogenetic analysis and temporal diversification of mosquitoes (Diptera: Culicidae) based on nuclear genes and morphology. BMC Evol Biol 9, 298, doi:10.1186/1471-2148-9-298 (2009).
OpenUrl CrossRef PubMed
↵
Chaves, R., Ferreira, D., Mendes-da-Silva, A., Meles, S. & Adega, F. FA-SAT Is an Old Satellite DNA Frozen in Several Bilateria Genomes. Genome Biol Evol 9, 3073–3087, doi:10.1093/gbe/evx212 (2017).
OpenUrl CrossRef
↵
Bartel, D. P. MicroRNAs: target recognition and regulatory functions. Cell 136, 215–233, doi:10.1016/j.cell.2009.01.002 (2009).
OpenUrl CrossRef PubMed Web of Science
↵
Zhang, D. et al. The piRNA targeting rules and the resistance to piRNA silencing in endogenous genes. Science (80-) 359, 587–592, doi:10.1126/science.aao2840 (2018).
OpenUrl Abstract/FREE Full Text
↵
Shen, E. Z. et al. Identification of piRNA Binding Sites Reveals the Argonaute Regulatory Landscape of the C. elegans Germline. Cell 172, 937–951 e918, doi:10.1016/j.cell.2018.02.002 (2018).
OpenUrl CrossRef
↵
Matsumoto, N. et al. Crystal Structure of Silkworm PIWI-Clade Argonaute Siwi Bound to piRNA. Cell 167, 484–497 e489, doi:10.1016/j.cell.2016.09.002 (2016).
OpenUrl CrossRef PubMed
Wang, Y. et al. Structure of an argonaute silencing complex with a seed-containing guide DNA and target RNA duplex. Nature 456, 921–926, doi:10.1038/nature07666 (2008).
OpenUrl CrossRef PubMed Web of Science
↵
Wang, Y., Sheng, G., Juranek, S., Tuschl, T. & Patel, D. J. Structure of the guide-strand-containing argonaute silencing complex. Nature 456, 209–213, doi:10.1038/nature07315 (2008).
OpenUrl CrossRef PubMed Web of Science
↵
Mohn, F., Handler, D. & Brennecke, J. Noncoding RNA. piRNA-guided slicing specifies transcripts for Zucchini-dependent, phased piRNA biogenesis. Science (80-) 348, 812–817, doi:10.1126/science.aaa1039 (2015).
OpenUrl Abstract/FREE Full Text
↵
Reuter, M. et al. Miwi catalysis is required for piRNA amplification-independent LINE1 transposon silencing. Nature 480, 264–267, doi:10.1038/nature10672 (2011).
OpenUrl CrossRef PubMed Web of Science
↵
Goh, W. S. et al. piRNA-directed cleavage of meiotic transcripts regulates spermatogenesis. Genes Dev 29, 1032–1044, doi:10.1101/gad.260455.115 (2015).
OpenUrl Abstract/FREE Full Text
↵
Feliciello, I., Akrap, I. & Ugarkovic, D. Satellite DNA Modulates Gene Expression in the Beetle Tribolium castaneum after Heat Stress. PLoS Genet 11, e1005466, doi:10.1371/journal.pgen.1005466 (2015).
OpenUrl CrossRef
↵
Grimson, A. et al. MicroRNA targeting specificity in mammals: determinants beyond seed pairing. Mol Cell 27, 91–105, doi:10.1016/j.molcel.2007.06.017 (2007).
OpenUrl CrossRef PubMed Web of Science
↵
Ugarkovic, D. Functional elements residing within satellite DNAs. EMBO Rep 6, 1035–1039, doi:10.1038/sj.embor.7400558 (2005).
OpenUrl Abstract/FREE Full Text
↵
Tadros, W. & Lipshitz, H. D. The maternal-to-zygotic transition: a play in two acts. Development 136, 3033–3042, doi:10.1242/dev.033183 (2009).
OpenUrl Abstract/FREE Full Text
↵
Thomsen, S., Anders, S., Janga, S. C., Huber, W. & Alonso, C. R. Genome-wide analysis of mRNA decay patterns during early Drosophila development. Genome Biol 11, R93, doi:10.1186/gb-2010-11-9-r93 (2010).
OpenUrl CrossRef PubMed
↵
Giraldez, A. J. et al. Zebrafish MiR-430 promotes deadenylation and clearance of maternal mRNAs. Science (80-) 312, 75–79, doi:10.1126/science.1122689 (2006).
OpenUrl Abstract/FREE Full Text
↵
Lund, E., Liu, M., Hartley, R. S., Sheets, M. D. & Dahlberg, J. E. Deadenylation of maternal mRNAs mediated by miR-427 in Xenopus laevis embryos. RNA 15, 2351–2363, doi:10.1261/rna.1882009 (2009).
OpenUrl Abstract/FREE Full Text
↵
Bushati, N., Stark, A., Brennecke, J. & Cohen, S. M. Temporal reciprocity of miRNAs and their targets during the maternal-to-zygotic transition in Drosophila. Curr Biol 18, 501–506, doi:10.1016/j.cub.2008.02.081 (2008).
OpenUrl CrossRef PubMed Web of Science
↵
Rouget, C. et al. Maternal mRNA deadenylation and decay by the piRNA pathway in the early Drosophila embryo. Nature 467, 1128–1132, doi:10.1038/nature09465 (2010).
OpenUrl CrossRef PubMed Web of Science
↵
Barckmann, B. et al. Aubergine iCLIP Reveals piRNA-Dependent Decay of mRNAs Involved in Germ Cell Development in the Early Embryo. Cell Rep 12, 1205–1216, doi:10.1016/j.celrep.2015.07.030 (2015).
OpenUrl CrossRef PubMed
↵
Fansiri, T. et al. Genetic mapping of specific interactions between Aedes aegypti mosquitoes and dengue viruses. PLoS Genet 9, e1003621, doi:10.1371/journal.pgen.1003621 (2013).
OpenUrl CrossRef PubMed
↵
Goertz, G. P., Vogels, C. B. F., Geertsema, C., Koenraadt, C. J. M. & Pijlman, G. P. Mosquito co-infection with Zika and chikungunya virus allows simultaneous transmission without affecting vector competence of Aedes aegypti. PLoS Negl Trop Dis 11, e0005654, doi:10.1371/journal.pntd.0005654 (2017).
OpenUrl CrossRef
↵
Mohlmann, T. W. R. et al. Community analysis of the abundance and diversity of mosquito species (Diptera: Culicidae) in three European countries at different latitudes. Parasit Vectors 10, 510, doi:10.1186/s13071-017-2481-1 (2017).
OpenUrl CrossRef
↵
Joosten, J. et al. The Tudor protein Veneno assembles the ping-pong amplification complex that produces viral piRNAs in Aedes mosquitoes. doi: https://doi.org/10.1101/242305 (2018).
↵
Pall, G. S. & Hamilton, A. J. Improved northern blot method for enhanced detection of small RNA. Nat Protoc 3, 1077–1084, doi:10.1038/nprot.2008.67 (2008).
OpenUrl CrossRef PubMed Web of Science
↵
van Rij, R. P. et al. The RNA silencing endonuclease Argonaute 2 mediates specific antiviral immunity in Drosophila melanogaster. Genes Dev 20, 2985–2995, doi:10.1101/gad.1482006 (2006).
OpenUrl Abstract/FREE Full Text
↵
Ramakers, C., Ruijter, J. M., Deprez, R. H. & Moorman, A. F. Assumption-free analysis of quantitative real-time polymerase chain reaction (PCR) data. Neurosci Lett 339, 62–66 (2003).
OpenUrl CrossRef PubMed Web of Science
↵
Trpis, M. A new bleaching and decalcifying method for general use in zoology. Can J Zool 48, 892–893 (1977).
OpenUrl
↵
Katoh, K., Misawa, K., Kuma, K. & Miyata, T. MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res 30, 3059–3066 (2002).
OpenUrl CrossRef PubMed Web of Science
↵
Wagih, O. ggseqlogo: a versatile R package for drawing sequence logos. Bioinformatics 33, 3645–3647, doi:10.1093/bioinformatics/btx469 (2017).
OpenUrl CrossRef PubMed
↵
van Cleef, K. W. et al. Mosquito and Drosophila entomobirnaviruses suppress dsRNA- and siRNA-induced RNAi. Nucleic Acids Res 42, 8732–8744, doi:10.1093/nar/gku528 (2014).
OpenUrl CrossRef PubMed Web of Science
↵
Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21, doi:10.1093/bioinformatics/bts635 (2013).
OpenUrl CrossRef PubMed Web of Science
↵
Patro, R., Duggal, G., Love, M. I., Irizarry, R. A. & Kingsford, C. Salmon provides fast and bias-aware quantification of transcript expression. Nat Methods 14, 417–419, doi:10.1038/nmeth.4197 (2017).
OpenUrl CrossRef PubMed
↵
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol 15, 550, doi:10.1186/s13059-014-0550-8 (2014).
OpenUrl CrossRef PubMed
↵
Rehmsmeier, M., Steffen, P., Hochsmann, M. & Giegerich, R. Fast and effective prediction of microRNA/target duplexes. RNA 10, 1507–1517, doi:10.1261/rna.5248604 (2004).
OpenUrl Abstract/FREE Full Text
↵
Akbari, O. S. et al. The developmental transcriptome of the mosquito Aedes aegypti, an invasive species and major arbovirus vector. G3 (Bethesda) 3, 1493–1509, doi:10.1534/g3.113.006742 (2013).
OpenUrl Abstract/FREE Full Text
↵
Martin, M. Cutadapt removes adapters from high-throughput sequencing reads. EMBnet.journal 17, 10–12.
↵
Langmead, B., Trapnell, C., Pop, M. & Salzberg, S. L. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol 10, R25, doi:10.1186/gb-2009-10-3-r25 (2009).
OpenUrl CrossRef PubMed
↵
Lewis, S. H. et al. Pan-arthropod analysis reveals somatic piRNAs as an ancestral defence against transposable elements. Nature ecology & evolution 2, 174–181, doi:10.1038/s41559-017-0403-4 (2018).
OpenUrl CrossRef
↵
Quinlan, A. R. BEDTools: The Swiss-Army Tool for Genome Feature Analysis. Curr Protoc Bioinformatics 47, 11 12 11–34, doi:10.1002/0471250953.bi1112s47 (2014).
OpenUrl CrossRef PubMed
↵
Wickham, H. ggplot2: Elegant Graphics for Data Analysis. (Springer-Verlag New York, 2016).
↵
1. E. Mathé &
2. S. Davis
Hahne, F. & Ivanek, R. in Statistical Genomics. Methods in Molecular Biology Vol. 1418 (eds E. Mathé & S. Davis) (Humana Press, New York, NY, 2016).

View the discussion thread.

Posted January 15, 2020.

Download PDF

Citation Tools

Subject Area

Molecular Biology

Subject Areas

All Articles

Animal Behavior and Cognition (5209)
Biochemistry (11730)
Bioengineering (8743)
Bioinformatics (29179)
Biophysics (14964)
Cancer Biology (12080)
Cell Biology (17399)
Clinical Trials (138)
Developmental Biology (9417)
Ecology (14174)
Epidemiology (2067)
Evolutionary Biology (18294)
Genetics (12233)
Genomics (16791)
Immunology (11858)
Microbiology (28051)
Molecular Biology (11575)
Neuroscience (60919)
Paleontology (451)
Pathology (1870)
Pharmacology and Toxicology (3238)
Physiology (4955)
Plant Biology (10422)
Scientific Communication and Education (1682)
Synthetic Biology (2881)
Systems Biology (7338)
Zoology (1650)

[1] ↵
Garrido-Ramos, M. A. Satellite DNA: An Evolving Topic. Genes (Basel) 8, doi:10.3390/genes8090230 (2017).
OpenUrl CrossRef

[2] ↵
Brajkovic, J., Feliciello, I., Bruvo-Madaric, B. & Ugarkovic, D. Satellite DNA-like elements associated with genes within euchromatin of the beetle Tribolium castaneum. G3 (Bethesda) 2, 931–941, doi:10.1534/g3.112.003467 (2012).
OpenUrl CrossRef

[3] ↵
Kuhn, G. C., Kuttler, H., Moreira-Filho, O. & Heslop-Harrison, J. S. The 1.688 repetitive DNA of Drosophila: concerted evolution at different genomic scales and association with genes. Mol Biol Evol 29, 7–11, doi:10.1093/molbev/msr173 (2012).
OpenUrl CrossRef PubMed Web of Science

[4] ↵
Girard, A., Sachidanandam, R., Hannon, G. J. & Carmell, M. A. A germline-specific class of small RNAs binds mammalian Piwi proteins. Nature 442, 199–202, doi:10.1038/nature04917 (2006).
OpenUrl CrossRef PubMed Web of Science

[5] ↵
Lau, N. C. et al. Characterization of the piRNA complex from rat testes. Science (80-) 313, 363–367, doi:10.1126/science.1130164 (2006).
OpenUrl Abstract/FREE Full Text

[6] ↵
Melters, D. P. et al. Comparative analysis of tandem repeats from hundreds of species reveals unique insights into centromere evolution. Genome Biol 14, R10, doi:10.1186/gb-2013-14-1-r10 (2013).
OpenUrl CrossRef PubMed

[7] ↵
Kit, S. Equilibrium sedimentation in density gradients of DNA preparations from animal tissues. J Mol Biol 3, 711–716 (1961).
OpenUrl PubMed Web of Science

[8] ↵
Sueoka, N. Variation and heterogeneity of base composition of deoxyribonucleic acids: a compilation of old and new data. J. Mol. Biol. 3, 31–40 (1961).
OpenUrl CrossRef PubMed Web of Science

[9] ↵
Hall, I. M. et al. Establishment and maintenance of a heterochromatin domain. Science (80-) 297, 2232–2237, doi:10.1126/science.1076466 (2002).
OpenUrl Abstract/FREE Full Text

[10] Menon, D. U., Coarfa, C., Xiao, W., Gunaratne, P. H. & Meller, V. H. siRNAs from an X-linked satellite repeat promote X-chromosome recognition in Drosophila melanogaster. Proc Natl Acad Sci U S A 111, 16460–16465, doi:10.1073/pnas.1410534111 (2014).
OpenUrl Abstract/FREE Full Text

[11] Zakrzewski, F. et al. Epigenetic profiling of heterochromatic satellite DNA. Chromosoma 120, 409–422, doi:10.1007/s00412-011-0325-x (2011).
OpenUrl CrossRef PubMed Web of Science

[12] Fukagawa, T. et al. Dicer is essential for formation of the heterochromatin structure in vertebrate cells. Nat Cell Biol 6, 784–791, doi:10.1038/ncb1155 (2004).
OpenUrl CrossRef PubMed Web of Science

[13] Kanellopoulou, C. et al. Dicer-deficient mouse embryonic stem cells are defective in differentiation and centromeric silencing. Genes Dev 19, 489–501, doi:10.1101/gad.1248505 (2005).
OpenUrl Abstract/FREE Full Text

[14] ↵
Pezer, Z. & Ugarkovic, D. Satellite DNA-associated siRNAs as mediators of heat shock response in insects. RNA Biol 9, 587–595, doi:10.4161/rna.20019 (2012).
OpenUrl CrossRef PubMed Web of Science

[15] May, B. P., Lippman, Z. B., Fang, Y., Spector, D. L. & Martienssen, R. A. Differential regulation of strand-specific transcripts from Arabidopsis centromeric satellite repeats. PLoS Genet 1, e79, doi:10.1371/journal.pgen.0010079 (2005).
OpenUrl CrossRef PubMed

[16] ↵
Volpe, T. A. et al. Regulation of heterochromatic silencing and histone H3 lysine-9 methylation by RNAi. Science (80-) 297, 1833–1837, doi:10.1126/science.1074973 (2002).
OpenUrl Abstract/FREE Full Text

[17] ↵
Matthews, B. J. et al. Improved Aedes aegypti mosquito reference genome assembly enables biological discovery and vector control. bioRxiv (2017).

[18] ↵
Siomi, M. C., Sato, K., Pezic, D. & Aravin, A. A. PIWI-interacting small RNAs: the vanguard of genome defence. Nat Rev Mol Cell Biol 12, 246–258, doi:10.1038/nrm3089 (2011).
OpenUrl CrossRef PubMed

[19] ↵
Czech, B. & Hannon, G. J. One Loop to Rule Them All: The Ping-Pong Cycle and piRNA-Guided Silencing. Trends Biochem Sci 41, 324–337, doi:10.1016/j.tibs.2015.12.008 (2016).
OpenUrl CrossRef PubMed

[20] ↵
Arensburger, P., Hice, R. H., Wright, J. A., Craig, N. L. & Atkinson, P. W. The mosquito Aedes aegypti has a large genome size and high transposable element load but contains a low proportion of transposon-specific piRNAs. BMC Genomics 12, 606, doi:10.1186/1471-2164-12-606 (2011).
OpenUrl CrossRef PubMed

[21] ↵
Kawaoka, S., Izumi, N., Katsuma, S. & Tomari, Y. 3’ end formation of PIWI-interacting RNAs in vitro. Mol Cell 43, 1015–1022, doi:10.1016/j.molcel.2011.07.029 (2011).
OpenUrl CrossRef PubMed Web of Science

[22] Saito, K. et al. Pimet, the Drosophila homolog of HEN1, mediates 2’-O-methylation of Piwi-interacting RNAs at their 3’ ends. Genes Dev 21, 1603–1608, doi:10.1101/gad.1563607 (2007).
OpenUrl Abstract/FREE Full Text

[23] ↵
Horwich, M. D. et al. The Drosophila RNA methyltransferase, DmHen1, modifies germline piRNAs and single-stranded siRNAs in RISC. Curr Biol 17, 1265–1272, doi:10.1016/j.cub.2007.06.030 (2007).
OpenUrl CrossRef PubMed Web of Science

[24] ↵
Miesen, P., Joosten, J. & van Rij, R. P. PIWIs Go Viral: Arbovirus-Derived piRNAs in Vector Mosquitoes. PLoS Pathog 12, e1006017, doi:10.1371/journal.ppat.1006017 (2016).
OpenUrl CrossRef

[25] ↵
Miesen, P., Girardi, E. & van Rij, R. P. Distinct sets of PIWI proteins produce arbovirus and transposon-derived piRNAs in Aedes aegypti mosquito cells. Nucleic Acids Res 43, 6545–6556, doi:10.1093/nar/gkv590 (2015).
OpenUrl CrossRef PubMed

[26] ↵
Schnettler, E. et al. Knockdown of piRNA pathway proteins results in enhanced Semliki Forest virus production in mosquito cells. J Gen Virol 94, 1680–1689, doi:10.1099/vir.0.053850-0 (2013).
OpenUrl CrossRef PubMed Web of Science

[27] ↵
Varjak, M. et al. Aedes aegypti Piwi4 Is a Noncanonical PIWI Protein Involved in Antiviral Responses. mSphere 2, doi:10.1128/mSphere.00144-17 (2017).
OpenUrl Abstract/FREE Full Text

[28] ↵
Plohl, M. et al. Long-term conservation vs high sequence divergence: the case of an extraordinarily old satellite DNA in bivalve mollusks. Heredity (Edinb) 104, 543–551, doi:10.1038/hdy.2009.141 (2010).
OpenUrl CrossRef PubMed Web of Science

[29] ↵
Li, Y. X. & Kirby, M. L. Coordinated and conserved expression of alphoid repeat and alphoid repeat-tagged coding sequences. Dev Dyn 228, 72–81, doi:10.1002/dvdy.10355 (2003).
OpenUrl CrossRef PubMed

[30] ↵
Martinez-Lage, A., Rodriguez-Farina, F., Gonzalez-Tizon, A. & Mendez, J. Origin and evolution of Mytilus mussel satellite DNAs. Genome 48, 247–256, doi:10.1139/g04-115 (2005).
OpenUrl CrossRef PubMed

[31] ↵
Henikoff, S., Ahmad, K. & Malik, H. S. The centromere paradox: stable inheritance with rapidly evolving DNA. Science (80-) 293, 1098–1102, doi:10.1126/science.1062939 (2001).
OpenUrl Abstract/FREE Full Text

[32] ↵
Plohl, M., Mestrovic, N. & Mravinac, B. Satellite DNA evolution. Genome Dyn 7, 126–152, doi:10.1159/000337122 (2012).
OpenUrl CrossRef

[33] ↵
Reidenbach, K. R. et al. Phylogenetic analysis and temporal diversification of mosquitoes (Diptera: Culicidae) based on nuclear genes and morphology. BMC Evol Biol 9, 298, doi:10.1186/1471-2148-9-298 (2009).
OpenUrl CrossRef PubMed

[34] ↵
Chaves, R., Ferreira, D., Mendes-da-Silva, A., Meles, S. & Adega, F. FA-SAT Is an Old Satellite DNA Frozen in Several Bilateria Genomes. Genome Biol Evol 9, 3073–3087, doi:10.1093/gbe/evx212 (2017).
OpenUrl CrossRef

[35] ↵
Bartel, D. P. MicroRNAs: target recognition and regulatory functions. Cell 136, 215–233, doi:10.1016/j.cell.2009.01.002 (2009).
OpenUrl CrossRef PubMed Web of Science

[36] ↵
Zhang, D. et al. The piRNA targeting rules and the resistance to piRNA silencing in endogenous genes. Science (80-) 359, 587–592, doi:10.1126/science.aao2840 (2018).
OpenUrl Abstract/FREE Full Text

[37] ↵
Shen, E. Z. et al. Identification of piRNA Binding Sites Reveals the Argonaute Regulatory Landscape of the C. elegans Germline. Cell 172, 937–951 e918, doi:10.1016/j.cell.2018.02.002 (2018).
OpenUrl CrossRef

[38] ↵
Matsumoto, N. et al. Crystal Structure of Silkworm PIWI-Clade Argonaute Siwi Bound to piRNA. Cell 167, 484–497 e489, doi:10.1016/j.cell.2016.09.002 (2016).
OpenUrl CrossRef PubMed

[39] Wang, Y. et al. Structure of an argonaute silencing complex with a seed-containing guide DNA and target RNA duplex. Nature 456, 921–926, doi:10.1038/nature07666 (2008).
OpenUrl CrossRef PubMed Web of Science

[40] ↵
Wang, Y., Sheng, G., Juranek, S., Tuschl, T. & Patel, D. J. Structure of the guide-strand-containing argonaute silencing complex. Nature 456, 209–213, doi:10.1038/nature07315 (2008).
OpenUrl CrossRef PubMed Web of Science

[41] ↵
Mohn, F., Handler, D. & Brennecke, J. Noncoding RNA. piRNA-guided slicing specifies transcripts for Zucchini-dependent, phased piRNA biogenesis. Science (80-) 348, 812–817, doi:10.1126/science.aaa1039 (2015).
OpenUrl Abstract/FREE Full Text

[42] ↵
Reuter, M. et al. Miwi catalysis is required for piRNA amplification-independent LINE1 transposon silencing. Nature 480, 264–267, doi:10.1038/nature10672 (2011).
OpenUrl CrossRef PubMed Web of Science

[43] ↵
Goh, W. S. et al. piRNA-directed cleavage of meiotic transcripts regulates spermatogenesis. Genes Dev 29, 1032–1044, doi:10.1101/gad.260455.115 (2015).
OpenUrl Abstract/FREE Full Text

[44] ↵
Feliciello, I., Akrap, I. & Ugarkovic, D. Satellite DNA Modulates Gene Expression in the Beetle Tribolium castaneum after Heat Stress. PLoS Genet 11, e1005466, doi:10.1371/journal.pgen.1005466 (2015).
OpenUrl CrossRef

[45] ↵
Grimson, A. et al. MicroRNA targeting specificity in mammals: determinants beyond seed pairing. Mol Cell 27, 91–105, doi:10.1016/j.molcel.2007.06.017 (2007).
OpenUrl CrossRef PubMed Web of Science

[46] ↵
Ugarkovic, D. Functional elements residing within satellite DNAs. EMBO Rep 6, 1035–1039, doi:10.1038/sj.embor.7400558 (2005).
OpenUrl Abstract/FREE Full Text

[47] ↵
Tadros, W. & Lipshitz, H. D. The maternal-to-zygotic transition: a play in two acts. Development 136, 3033–3042, doi:10.1242/dev.033183 (2009).
OpenUrl Abstract/FREE Full Text

[48] ↵
Thomsen, S., Anders, S., Janga, S. C., Huber, W. & Alonso, C. R. Genome-wide analysis of mRNA decay patterns during early Drosophila development. Genome Biol 11, R93, doi:10.1186/gb-2010-11-9-r93 (2010).
OpenUrl CrossRef PubMed

[49] ↵
Giraldez, A. J. et al. Zebrafish MiR-430 promotes deadenylation and clearance of maternal mRNAs. Science (80-) 312, 75–79, doi:10.1126/science.1122689 (2006).
OpenUrl Abstract/FREE Full Text

[50] ↵
Lund, E., Liu, M., Hartley, R. S., Sheets, M. D. & Dahlberg, J. E. Deadenylation of maternal mRNAs mediated by miR-427 in Xenopus laevis embryos. RNA 15, 2351–2363, doi:10.1261/rna.1882009 (2009).
OpenUrl Abstract/FREE Full Text

[51] ↵
Bushati, N., Stark, A., Brennecke, J. & Cohen, S. M. Temporal reciprocity of miRNAs and their targets during the maternal-to-zygotic transition in Drosophila. Curr Biol 18, 501–506, doi:10.1016/j.cub.2008.02.081 (2008).
OpenUrl CrossRef PubMed Web of Science

[52] ↵
Rouget, C. et al. Maternal mRNA deadenylation and decay by the piRNA pathway in the early Drosophila embryo. Nature 467, 1128–1132, doi:10.1038/nature09465 (2010).
OpenUrl CrossRef PubMed Web of Science

[53] ↵
Barckmann, B. et al. Aubergine iCLIP Reveals piRNA-Dependent Decay of mRNAs Involved in Germ Cell Development in the Early Embryo. Cell Rep 12, 1205–1216, doi:10.1016/j.celrep.2015.07.030 (2015).
OpenUrl CrossRef PubMed

[54] ↵
Fansiri, T. et al. Genetic mapping of specific interactions between Aedes aegypti mosquitoes and dengue viruses. PLoS Genet 9, e1003621, doi:10.1371/journal.pgen.1003621 (2013).
OpenUrl CrossRef PubMed

[55] ↵
Goertz, G. P., Vogels, C. B. F., Geertsema, C., Koenraadt, C. J. M. & Pijlman, G. P. Mosquito co-infection with Zika and chikungunya virus allows simultaneous transmission without affecting vector competence of Aedes aegypti. PLoS Negl Trop Dis 11, e0005654, doi:10.1371/journal.pntd.0005654 (2017).
OpenUrl CrossRef

[56] ↵
Mohlmann, T. W. R. et al. Community analysis of the abundance and diversity of mosquito species (Diptera: Culicidae) in three European countries at different latitudes. Parasit Vectors 10, 510, doi:10.1186/s13071-017-2481-1 (2017).
OpenUrl CrossRef

[57] ↵
Joosten, J. et al. The Tudor protein Veneno assembles the ping-pong amplification complex that produces viral piRNAs in Aedes mosquitoes. doi: https://doi.org/10.1101/242305 (2018).

[58] ↵
Pall, G. S. & Hamilton, A. J. Improved northern blot method for enhanced detection of small RNA. Nat Protoc 3, 1077–1084, doi:10.1038/nprot.2008.67 (2008).
OpenUrl CrossRef PubMed Web of Science

[59] ↵
van Rij, R. P. et al. The RNA silencing endonuclease Argonaute 2 mediates specific antiviral immunity in Drosophila melanogaster. Genes Dev 20, 2985–2995, doi:10.1101/gad.1482006 (2006).
OpenUrl Abstract/FREE Full Text

[60] ↵
Ramakers, C., Ruijter, J. M., Deprez, R. H. & Moorman, A. F. Assumption-free analysis of quantitative real-time polymerase chain reaction (PCR) data. Neurosci Lett 339, 62–66 (2003).
OpenUrl CrossRef PubMed Web of Science

[61] ↵
Trpis, M. A new bleaching and decalcifying method for general use in zoology. Can J Zool 48, 892–893 (1977).
OpenUrl

[62] ↵
Katoh, K., Misawa, K., Kuma, K. & Miyata, T. MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res 30, 3059–3066 (2002).
OpenUrl CrossRef PubMed Web of Science

[63] ↵
Wagih, O. ggseqlogo: a versatile R package for drawing sequence logos. Bioinformatics 33, 3645–3647, doi:10.1093/bioinformatics/btx469 (2017).
OpenUrl CrossRef PubMed

[64] ↵
van Cleef, K. W. et al. Mosquito and Drosophila entomobirnaviruses suppress dsRNA- and siRNA-induced RNAi. Nucleic Acids Res 42, 8732–8744, doi:10.1093/nar/gku528 (2014).
OpenUrl CrossRef PubMed Web of Science

[65] ↵
Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21, doi:10.1093/bioinformatics/bts635 (2013).
OpenUrl CrossRef PubMed Web of Science

[66] ↵
Patro, R., Duggal, G., Love, M. I., Irizarry, R. A. & Kingsford, C. Salmon provides fast and bias-aware quantification of transcript expression. Nat Methods 14, 417–419, doi:10.1038/nmeth.4197 (2017).
OpenUrl CrossRef PubMed

[67] ↵
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol 15, 550, doi:10.1186/s13059-014-0550-8 (2014).
OpenUrl CrossRef PubMed

[68] ↵
Rehmsmeier, M., Steffen, P., Hochsmann, M. & Giegerich, R. Fast and effective prediction of microRNA/target duplexes. RNA 10, 1507–1517, doi:10.1261/rna.5248604 (2004).
OpenUrl Abstract/FREE Full Text

[69] ↵
Akbari, O. S. et al. The developmental transcriptome of the mosquito Aedes aegypti, an invasive species and major arbovirus vector. G3 (Bethesda) 3, 1493–1509, doi:10.1534/g3.113.006742 (2013).
OpenUrl Abstract/FREE Full Text

[70] ↵
Martin, M. Cutadapt removes adapters from high-throughput sequencing reads. EMBnet.journal 17, 10–12.

[71] ↵
Langmead, B., Trapnell, C., Pop, M. & Salzberg, S. L. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol 10, R25, doi:10.1186/gb-2009-10-3-r25 (2009).
OpenUrl CrossRef PubMed

[72] ↵
Lewis, S. H. et al. Pan-arthropod analysis reveals somatic piRNAs as an ancestral defence against transposable elements. Nature ecology & evolution 2, 174–181, doi:10.1038/s41559-017-0403-4 (2018).
OpenUrl CrossRef

[73] ↵
Quinlan, A. R. BEDTools: The Swiss-Army Tool for Genome Feature Analysis. Curr Protoc Bioinformatics 47, 11 12 11–34, doi:10.1002/0471250953.bi1112s47 (2014).
OpenUrl CrossRef PubMed

[74] ↵
Wickham, H. ggplot2: Elegant Graphics for Data Analysis. (Springer-Verlag New York, 2016).

[75] ↵
E. Mathé &
S. Davis
Hahne, F. & Ivanek, R. in Statistical Genomics. Methods in Molecular Biology Vol. 1418 (eds E. Mathé & S. Davis) (Humana Press, New York, NY, 2016).

[76] E. Mathé &

[77] S. Davis