Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Leishmania naiffi and Leishmania guyanensis reference genomes highlight genome structure and gene content evolution in the Viannia subgenus

View ORCID ProfileSimone Coughlan, Ali Shirley Taylor, Eoghan Feane, Mandy Sanders, Gabriele Schonian, View ORCID ProfileJames A. Cotton, Tim Downing
doi: https://doi.org/10.1101/233148
Simone Coughlan
1School of Mathematics, Applied Mathematics and Statistics, National University of Ireland, Galway, Ireland.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Simone Coughlan
Ali Shirley Taylor
2School of Biotechnology, Dublin City University, Dublin, Ireland.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Eoghan Feane
2School of Biotechnology, Dublin City University, Dublin, Ireland.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Mandy Sanders
3Wellcome Trust Sanger Institute, Hinxton, UK.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Gabriele Schonian
4Charité University Medicine, Berlin, Germany.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
James A. Cotton
3Wellcome Trust Sanger Institute, Hinxton, UK.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for James A. Cotton
Tim Downing
1School of Mathematics, Applied Mathematics and Statistics, National University of Ireland, Galway, Ireland.
2School of Biotechnology, Dublin City University, Dublin, Ireland.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Preview PDF
Loading

Abstract

The unicellular protozoan parasite Leishmania causes the neglected tropical disease leishmaniasis, affecting 12 million people in 98 countries. In South America where the Viannia subgenus predominates, so far only L. (Viannia) braziliensis and L. (V.) panamensis have been sequenced, assembled and annotated as reference genomes. Addressing this deficit in molecular information can inform species typing, epidemiological monitoring and clinical treatment. Here, L. (V.) naiffi and L. (V.) guyanensis genomic DNA was sequenced to assemble these two genomes as draft references from short sequence reads. The methods used were tested using short sequence reads for L. braziliensis M2904 against its published reference as a comparison. This assembly and annotation pipeline identified 70 additional genes not annotated on the original M2904 reference. Phylogenetic and evolutionary comparisons of L. guyanensis and L. naiffi with ten other Viannia genomes revealed four traits common to all Viannia: aneuploidy, 22 orthologous groups of genes absent in other Leishmania subgenera, elevated TATE transposon copies, and a high NADH-dependent fumarate reductase gene copy number. Within the Viannia, there were limited structural changes in genome architecture specific to individual species: a 45 Kb amplification on chromosome 34 was present in most, L. naiffi had a higher copy number of the virulence factor leishmanolysin, and laboratory isolate L. shawi M8408 had a possible minichromosome derived from chromosome 34. This combination of genome assembly, phylogenetics and comparative analysis across an extended panel of diverse Viannia has uncovered new insights into the origin and evolution of this subgenus and can help improve diagnostics for leishmaniasis surveillance.

Introduction

Most cutaneous (CL) and mucosal leishmaniasis (ML) cases in the Americas are the result of infection by Leishmania parasites belonging to the Viannia subgenus. The complexity of the molecular, epidemiological and ecological challenges associated with Leishmania in South America remains opaque due to our limited understanding of the biology of Viannia parasites. Nine Viannia (sub)species have been described so far: L. (V.) braziliensis, L. (V.)peruviana, L. (V.) guyanensis, L. (V.)panamensis, L. (V.) shawi, L. (V.) lainsoni, L. (V.) naiffi, L. (V.) lindenbergi and L. (V.) utingensis. CL and ML are endemic in 18 out 20 countries in the Americas [1] and are mainly associated with L. braziliensis, L. guyanensis, and L. panamensis, whose frequency varies geographically. Other species are less frequently associated with human disease, and some are restricted to certain areas [2].

Human CL is partially driven by transmission from sylvatic and peridomestic mammalian reservoirs [3], via sandflies of the genus Lutzomyia in the Americas, distinct from Phlebotomus sandflies in the Old World [4]. Although CL has spread to domestic and peridomestic niches due to migration, new settlements and deforestation [5–7], there is still a strong association of some Leishmania with sylvatic envinroments, such that human infection is accidentally acquired due to direct contact when handling livestock [8]. L. naiffi and L. guyanensis are among the Viannia species that are adapting to environments altered by humans, show variable responses to treatment, and diversity in the types of clinical manifestations presented.

L. naiffi was formally described from a parasite isolated in 1989 from its primary reservoir, the nine-banded armadillo (Dasypus novemcinctus), in Pará state of northern Brazil [9-11]. L. naiffi was initially placed in the Viannia subgenus based on its molecular and immunological characteristics [9]. Many phlebotomine species are likely to participate in the transmission of L. naiffi in Amazonia [12], including Lu. (Psathyromyia) ayrozai and Lu. (Psychodopygus) paraensis in Brazil [13], Lu. (Psathyromyia) squamiventris and Lu. tortura in Ecuador [14], and Lu. trapidoi and Lu. gomezi in Panama [30]. L. naiffi has been isolated from humans and armadillos [9–10], and detected in Thrichomys pachyurus rodents found in the same habitat as D. novemcinctus in Brazil [16]. The nine-banded armadillo is hunted, handled and consumed in the Americas and is regarded as a pest [11, 17-18]. People in the same vector range as these armadillos could be exposed to infective sandflies: three L. naiffi CL cases followed contact with armadillos in Suriname [19]. L. naiffi causes localised CL in humans with small discrete lesions on the hands, arms or legs [10, 20-21], which has been observed in Brazil, French Guiana, Ecuador, Peru and Suriname [19, 22]. CL due to L. naiffi usually responds to treatment [10, 22] and can be self-limiting [23], though poor response to antimonial or pentamidine therapy was reported in two patients in Manaus, Brazil [20].

L. guyanensis was first described in 1954 [24] and its primary hosts are the forest dwelling two-toed sloth (Choloepus didactylus) and the lesser anteater Tamandua tetradactyl [25]. Potential secondary reservoirs of L. guyanensis are Didelphis marsupialis (the common opossum) [26, 27], rodents from the genus Proechimys [25], Marmosops incanus (the grey slender opposum) [28] in Brazil, and D. novemcinctus [29]. Lu. umbratilis, Lu. anduzei and Lu. whitmani are prevalent in forests [30] and act as vectors of L. guyanensis [31-33]. L. guyanensis has been found in French Guiana, Bolivia, Brazil, Colombia, Guyana, Venezuela, Ecuador, Peru, Argentina and Suriname [34-39].

More precise genetic screening of Viannia isolates is necessary to trace hybridisation between species. Infection of humans, dogs and Lu. ovallesi with L. guyanensis/L. braziliensis hybrids was reported in Venezuela [40-41]. A L. shawi/L. guyanensis hybrid causing CL was detected in Amazonian Brazil [42], and L. naiffi has produced viable progeny with L. lainsoni [43] and L. braziliensis (Elisa Cupolillo, unpublished data). There is extensive evidence of interbreeding among L. braziliensis complex isolates, including more virulent L. braziliensis/L. peruviana hybrids with higher survival rates within hosts in vitro [44].

Leishmania genomes are characterised by several key features. Genes are organised as polycistronic transcription units that have a high degree of synteny across Leishmania species [45]. These polycistronic transcription units are co-transcribed by RNA polymerase II as polycistronic pre-mRNAs that are 5’-transpliced and 3’-polyadenylated [46, 47]. This means translation and stability of these mature mRNAs determines gene expression rather than transcription rates. In addition, Leishmania display extensive aneuploidy, frequently possess extrachromosomal amplifications driven by homologous recombination at repetitive sequences, and have variable gene copy numbers [48]. The Leishmania subgenus genomes of L. infantum, L. donovani, and L. major have 36 chromosomes [49], whereas Viannia genomes have 35 chromosomes due to a fusion of chromosomes 20 and 34 [45, 50]. In contrast to the species of the Leishmania subgenus, Viannia parasites possess encoding functioning RNA interference (RNAi) machinery that may mediate infecting viruses and transposable elements [51].

Fully annotated genomes have been described in detail for only two Viannia species: L. panamensis [51] and L. braziliensis [45, 48], limiting our comprehension of their evolutionary origin, genetic diversity and molecular function. Consequently, we present reference genomes of L. guyanensis LgCL085 and L. naiff LnCL223 to address these critical gaps. These new annotated references genomes were compared to genomes of other Viannia species to examine structural variation, sequence divergence, gene synteny and chromosome copy number changes. We contrasted the genomic architecture of L. guyanensis LgCL085 and L. naiffi LnCL223 with the L. braziliensis MHOM/BR/1975/M2903 assembly, two unannotated L. peruviana chromosome-level scaffold assemblies [52], the L. panamensis MHOM/PA/1994/PSC-1 reference and the L. braziliensis MHOM/BR/1975/M2904 reference. Furthermore, we assessed aneuploidy in five unassembled Viannia datasets isolated from humans, armadillos and primates which are commonly used in studies on Viannia parasites [53–56]: L. shawi reference isolate MCEB/BR/1984/M8408 also known as IOC_L1545, L. guyanensis MHOM/BR/1975/M4147 (iz34), L. naiffi MDAS/BR/1979/M5533 (IOC_L1365), L. lainsoni MHOM/BR/1981/M6426 (IOC_L1023), L. panamensis MHOM/PA/1974/WR120 [53] (IOC stands for Instituto Oswaldo Cruz).

Results

Genome assembly from short-reads

The genomes of L. guyanensis LgCL085 and L. naiffi LnCL223 were assembled from short reads, along with an assembly of L. braziliensis M2904 generated in the same way as a positive control [48] (Table 1). This facilitated comparison with the published M2904 genome which was assembled by capillary sequencing of a plasmid clone library together with extensive finishing work and with fosmid end sequencing [45], so that the ability of short reads to correctly and comprehensively resolve Leishmania genome architecture could be quantified.

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 1:

Data used in this study. The World Health Organisation (WHO) numbers are structured such that M is mammal, R is reptile, HOM is Homo, CAN is canine, DAS is Dasypus (an armadillo), CEB is Cebus (a primate), ARV is Arvicanthis (a rodent), TAR is Tarentolae and LAT is Latastia (a long-tailed lizard). The top two rows indicate the isolates for L. guyanensis and L. naiffi genomes published here. * SRA stands for SRA or TriTrypDB accession ID.

Firstly, the L. guyanensis LgCL085, L. naiffi LnCL223 and the L. braziliensis M2904 control reads were filtered to remove putative contaminant sequences identified by aberrant GC content, trimmed at the 3’ ends to remove low quality bases, and PCR primer sequences were removed (see Methods for details) resulting in 26,067,692 properly paired reads for L. guyanensis, 13,979,628 for L. naiffi, 34,592,618 for the L. braziliensis control (Table S1). These filtered reads for L. guyanensis, L. naiffi and L. braziliensis were de novo assembled into contigs using Velvet [57] with k-mers of 61 for L. guyanensis, 43 for L. naiffi and 43 for the L. braziliensis control optimised for each library.

The initial contigs were scaffolded using read pair information with SSPACE [58] to yield 2,800 L. guyanensis scaffolds with an N50 of 95.4 Kb, 6,530 L. naiffi scaffolds with an N50 of 24.3 Kb, and 3,782 L. braziliensis scaffolds with an N50 of 20.6 Kb (Table 2). The corrected scaffolds for L. guyanensis, L. naiffi and the L. braziliensis control were contiguated (aligned, ordered and oriented) using the extensively finished L. braziliensis M2904 reference with ABACAS [59]. The output was split into 35 pseudo-chromosomes and REAPR [60] broke scaffolds at possible misassemblies to assess contiguation accuracy. The pseudo-chromosome lengths of each sample approximated the length of each corresponding L. braziliensis M2904 reference chromosome with the exceptions of shorter L. guyanensis chromosomes 2, 4, 12 and 21, and the longer L. naiffi chromosome 1 (Figure S1). Post-assembly alignment of all bin contigs using BLASTn identified 44 L. guyanensis sequences spanning 4,566,791 bp as putative contaminants that were removed: half had high similarity to bacterium Niastella koreensis (Table S2).

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 2:

Summary of L. braziliensis reference M2904, L. braziliensis control, L. guyanensis LgCL085, and L. naiffi LnCL223 genome assembly contigs, scaffolds, gaps, read coverage, assembled chromosomal and contig sequence, and levels of gene annotation.

When the reads for each were mapped to its own assembled genome, the median read coverage was 56 for L. guyanensis, 36 for L. naiffi and 75 for the L. braziliensis control. The latter was on par with the 74-fold median coverage observed when M2904 short reads were mapped to the L. braziliensis reference [45, 48] (Table S3). The differing coverage levels correlated with the numbers of gaps in the final genome assembly of L. guyanensis (1,557, Table 2) and L. naiffi (3,853).

MLSA of L. guyanensis LgCL085 and L. naiffi LnCL223 with the Viannia subgenus

As a first step in investigating the genetic origins of these isolates, we examined their species identity using MLSA (multi-locus sequencing analysis). Four housekeeping gene sequences published for 95 Viannia isolates including L. braziliensis, L. lainsoni, L. lindenbergi, L. utingensis, L. guyanensis, L. shawi and L. naiffi [56] were compared with orthologs of each gene extracted from assemblies of L. naiffi LnCL223, L. guyanensis LgCL085, the L. braziliensis reference, L. panamensis PSC-1 and L. peruviana PAB-4377. Among the 95 were four samples with reads available [53]: L. shawi MCEB/BR/1984/M8408 (IOC_L1545), L. guyanensis MHOM/BR/1975/M4147 (iz34), L. naiffi MDAS/BR/1979/M5533 (IOC_L1365) and L. lainsoni MHOM/BR/1981/M6426 (IOC_L1023). The genes were aligned using Clustal Omega v1.1 [61] to create a network for the 102 isolates with SplitsTree v4.13.1 [62]. This replicated the expected highly reticulated structure [56] where L. braziliensis M2904 and L. peruviana PAB-4377 were in the L. braziliensis cluster (Figure 1).

Figure 1:
  • Download figure
  • Open in new tab
Figure 1: Middle: A neighbor-Net network of the uncorrected p-distances from concatenated 2,902-base sequences from four housekeeping genes for 102 Viannia samples. The genes were glucose-6-phosphate dehydrogenase (G6PD), 6-phosphogluconate dehydrogenase (6PGD), mannose phosphate isomerase (MPI) and isocitrate dehydrogenase (ICD). L. naiffi LnCL223 (cyan) is “New_L_naiffi_Reference” and is related to M5533 (IOC_L1365). L. guyanensis LgCL085 (blue) is “New_L_guyanensis_Reference” and is related to the L. shawi M8408 (IOC_L1545) assembly and the L. panamensis PSC-1 genome, but less so to L. guyanensis M4147 (iz34). The L. braziliensis M2904 reference and control are “M2904_Reference” and “M2904_Control”, proximal to L. peruviana PAB-4377. L. lainsoni M6426 (IOC_L1023) (green), L. utingensis (orange) and L. lindenbergi (pink) are shown. The isolate names and detail for each species complex is shown by insets in red (L. braziliensis), dark blue (L. guyanensis) and light blue (L. naiffi). For detailed viewing, the nexus file can be downloaded at https://figshare.com/s/eecf1c6b42ac4deb6acc and highresolution PDF at https://doi.org/10.6084/m9.figshare.5687329.

Previous work suggests that the L. guyanensis species complex includes L. panamensis and L. shawi because they show little genetic differentiation from one another [56, 63-65]. The MLSA here showed that the new L. guyanensis LgCL085 reference clustered phylogenetically in the L. guyanensis species complex, and had no sequence differences compared to L. panamensis PSC-1 and seven relative to L. shawi M8408 across the 2,902 sites aligned (Figure 1). L. guyanensis LgCL085 grouped with isolates classified as zymodeme Z26 by multilocus enzyme electrophoresis (MLEE) associated with L. shawi [54]. This was supported by the number and the alleles of genome-wide SNPs called using reads mapped to the L. braziliensis M2904 reference for L. guyanensis (355,267 SNPs), L. guyanensis M4147 (326,491), L. panamensis WR120 (294,459) and L. shawi M8408 (296,095) (Table S4).

The L. naiffi LnCL223 was closest to L. naiffi ISQU/BR/1994/IM3936, with two differences. It clustered with MLEE zymodeme Z49 based on the correspondence between the MLSA network and previously typed zymodemes, though L. naiffi is associated with more zymodemes than other Viannia. The number and the alleles of genome-wide SNPs called using reads mapped to the L. braziliensis reference were similar for L. naiffi (548,256) and M5533 (633,560) (Table S4) and consistent with the MLSA genetic distances.

There was no evidence of recent gene flow between these three species at any genome-wide 10 Kb segment and L. naiffi LnCL223 had fewer SNPs compared to L. braziliensis M2904 than L. guyanensis LgCL085 (Figure S2). Linking the genetic distance data with the MLSA network and previous work [56, 63-65], four genetically distinct species complexes are represented by the genome-sequenced Viannia: (i) braziliensis including L. peruviana, (ii) guyanensis including L. panamensis and L. shawi, (iii) naiffi, and (iv) lainsoni (Table S4), and the less explored (v) lindenbergi and (vi) utingensis complexes (Figure 1).

Ancestral diploidy and constitutive aneuploidy in Viannia

The normalised chromosomal coverage of the L. guyanensis LgCL085 and L. naiffi LnCL223 reads mapped to L. braziliensis M2904 showed aneuploidy on a background of a diploid genome (Figure 2). The coverage levels of reads for L. peruviana LEM1537, L. peruviana PAB-4377, L. panamensis PSC-1 and the triploid L. braziliensis control mapped to the M2904 reference, confirmed previous work (Figure S3), including the L. braziliensis control (Figure S4), and demonstrated that assemblies from short read data were sufficient to estimate chromosome copy number differences. Repeating this for L. shawi M8408, L. naiffi M5533, L. guyanensis M4147, L. panamensis WR120 and L. lainsoni M6426 showed that all these Viannia were predominantly disomic and thus diploidy was the likely ancestral state of this subgenus (Figure 2).

Figure 2:
  • Download figure
  • Open in new tab
Figure 2: Normalised chromosome copy numbers of L. naiffi LnCL223 reads mapped to its assembly, L. guyanensis LgCL085 reads mapped to its assembly, and L. guyanensis M4147, L. lainsoni M6426, L. naiffi M5533, L. panamensis WR120 and L. shawi M8408 reads mapped to L. braziliensis M2904. Dashed lines indicate disomic, trisomic and tetrasomic states. Results for L. panamensis PSC-1 and L. peruviana PAB-4377 were previously published and are in Figure S3.

The somy patterns were supported by the results of mapping the reads of each sample to their own assembled genome or to the M2904 reference to produce the read depth allele frequency (RDAF) distributions from heterozygous SNPs. The majority of L. braziliensis M2904 control chromosomes had peaks with modes at ~33% and ~67% indicating trisomy, rather than a single peak at ~50% consistent with disomy (Figure S5). The RDAF distributions from reads mapped to its own assembly for L. guyanensis LgCL085 and L. naiffi LnCL223 had a mode of ~50% (Figure S6), including peaks indicating trisomy for LgCL085 chromosomes 13, 26 and 35 (Figure S7).

8,262 L. naiffi and 8,376 L. guyanensis genes annotated

A total of 8,262 genes were annotated on L. naiffi LnCL223: of these 8,104 were protein coding genes, 78 were tRNAs, 15 rRNA genes, four snoRNA genes, two snRNA genes, and 59 pseudogenes. 310 genes were on unassigned contigs (Table S3). 8,376 genes were annotated on L. guyanensis LgCL085: of these 8,230 were protein coding genes, 75 tRNAs, 14 rRNA genes, four snoRNA genes, two snRNA genes and 51 pseudogenes. 619 genes were on unassigned contigs

There were 8,161 genes (8,001 protein coding) transferred to the control L. braziliensis genome, along with 76 tRNAs, two snRNA genes, four snoRNA genes, 13 rRNA genes and 65 pseudogenes (Table 2). 7,719 of the protein coding genes (96.5%) clustered into 7,244 OGs, whereas 8,137 of the 8,375 (97.2%) protein coding genes on the L. braziliensis reference grouped into 7,383 OGs. This indicated that 97% of protein coding genes in OGs were recovered, and only 2.8% (235) across 201 OGs were absent in the M2904 control, mainly hypothetical or encoded ribosomal proteins (Table S5). In the same way, we found 70 protein coding genes (Table S6) in 62 OGs on the M2904 control absent in the published L. braziliensis annotation.

Few genes were present in L. braziliensis but absent in L. guyanensis LgCL085 and L. naiffi LnCL223. Coverage depth was used to predict each gene’s haploid copy number such that genes with haploid copy numbers at least twice the assembled copy number indicated partially assembled genes in the reference assembly. Thus, we investigated all OGs with haploid copy numbers at least twice the assembled copy number to quantity completeness of the assembly. Only 145 genes in 92 OGs on L. guyanensis LgCL085 (Table S7), 142 genes in 90 OGs on L. naiffi LnCL223 (Table S8) and 102 genes in 71 OGs (Table S9) on the L. braziliensis control met this criterion, indicating few unassembled genes in each assembly. One hypothetical gene (LnCL223_272760) in L. naiffi LnCL223 with no retrievable information had a haploid copy number of 15 (OG5_173495), whereas all other genomes examined here had zero to two copies.

A 245 Kb rearrangement akin to a minichromosome in L. shawi M8408

We discovered a putative minichromosome or amplification at the 3’ end of L. shawi M8408 chromosome 34 based on elevated coverage across a pair of inverted repeats spanning 245 Kb (Figure 3A). This locus spanned at least bases 1,840,001 to 1,936,232 (the end) in the L. braziliensis M2904 reference (Figure S8, Table S10). It was orthologous to a known 100 Kb amplification on L. panamensis PSC-1 chromosome 34 that was predicted to produce a minichromosome when amplified, and contained the frequently amplified LD1 (Leishmania DNA 1) region [66]. In contrast to the L. panamensis PSC-1 minichromosome, the L. shawi M8408 amplification was ~30 Kb longer and closer in length to the L. braziliensis M2903 245 Kb minichromosome [67].

Figure 3A:
  • Download figure
  • Open in new tab
Figure 3A: Median coverage (blue) in 10 Kb blocks for reads mapped to L. braziliensis M2904 chromosome 34 for nine Viannia isolates. The black horizontal line is the median chromosome 34 coverage. L. panamensis PSC-1 (first plot shown) and L. shawi M8408 (second) showed a 3’ jump in coverage (green) consistent with an amplification of inverted repeats that could form a linear minichromosome. In addition, this pair shared a 45 Kb amplification (pink) also found in the L. braziliensis M2904 control (third plot shown), L. naiffi M5533 (fourth), L. panamensis WR120 (fifth), L. peruviana LEM1537 (sixth), L. peruviana PAB-4377 (seventh) and L. guyanensis M4147 (eighth). This was absent in L. lainsoni M6426 (ninth).

A 45 Kb locus was amplified in most Viannia genomes

A 45 Kb amplification on chromosome 34 spanning a gene encoding a structural maintenance of chromosome (SMC) family protein and ten hypothetical genes had between two and four copies in all samples except L. lainsoni M6426 (Figure 3A, Table S10). Using the L. guyanensis gene annotation, putative functions were assigned to five of the ten hypothetical genes. This duplication spanned chromosomal location 1.32-1.35 Mb in the L. braziliensis M2904 reference and had two additional hypothetical genes in L. naiffi LnCL223 (LnCL223_343280 and LnCL223_343290, Figure 3B).

3B:
  • Download figure
  • Open in new tab
3B: Median coverage (blue) in 10 Kb blocks for L. guyanensis LgCL085 reads mapped to its own assembled chromosome 34 (top) and L. naiffi LnCL223 reads mapped to its own assembled chromosome 34 (bottom). The black horizontal line is the median chromosome 34 coverage. There was no evidence of a 3’ amplification, but the 45 Kb amplification (pink) spanned 44,791 bases in L. naiffi LnCL223 with four copies (at chr34:1,206,328-1,251,119) and 44,123 bases in L. guyanensis LgCL085 with three copies (at chr34: 1,195,232-1,239,355). The amplification had two additional hypothetical genes in L. naiffi LnCL223.

Genes exclusive to Viannia genomes

7,961 (96.7%) of the 8,230 genes annotated for L. guyanensis LgCL085 were in 7,381 OGs, 7,893 (97.4%) of the 8,104 L. naiffi LnCL223 genes in 7,324 OGs, and 7,692 (99.3%) of the L. panamensis PSC-1 7,748 in 7,245 OGs. A total of 6,835 of these OGs were shared with nine species from the Leishmania, Sauroleishmania and Viannia subgenera: L. (L.) major, L. (L.) mexicana, L. (L.) donovani (infantum), L. (V.) guyanensis, L. (V.) naiffi, L. (V.) braziliensis, L. (V.)panamensis, L. (S.) adleri, L. (S.) tarentolae (Table S11).

We identified 22 OGs exclusive to Viannia (Table S12): three OGs contained the RNAi pathway genes (DCL1, DCL2, RIF4). Another OG was the telomere-associated mobile elements (TATE) DNA transposons (OG5_132061), a dynamic feature of Viannia genomes [51] (Supplementary Results). Four OGs encoded a diacylglycerol kinase-like protein (OG5_133291), a nucleoside transporter (OG5_134097), a beta tubulin / amastin (oG5_183241), and a /zinc transporter (OG5_214682). The remaining 14 OGs contained hypothetical genes.

A NADH-dependent fumarate reductase gene (0G5_128620) was amplified in the Viannia examined here: L. guyanensis LgCL085 had 14 copies, L. naiffi LnCL223 had 16, L. panamensis PSC-1 had 16, L. peruviana PAB4377 had 23, L. peruviana LEM1537 had 14, and braziliensis M2904 had 12. This contrasted with the Leishmania and Sauroleishmania subgenera for which three to four copies had been reported for L. infantum, L. mexicana, L. major, L. adleri and L. tarentolae [68, 69]. This gene has been implicated in enabling parasites to resist oxidative stress and potentially aiding persistence, drug resistance and metastasis [70, 71].

Few species-specific genes in L. guyanensis LgCL085 and L. naiffi LnCL223

Four genes from four OGs unique to L. naiffi LnCL223 were identified compared to other Leishmania (Table S13). Of these four, hypothetical genes LnCL223_312570 and LnCL223_292920 had orthologs in T. brucei and T. vivax, respectively. The LnCL223_341350 protein product had 44-45% sequence identity with a Leptomonas transferase family protein, and LnCL223_352070 was a methylenetetrahydrofolate reductase (OG5_128744), but had no orthologs in the other eight Leishmania or five Trypanosoma species investigated here. L. guyanensis LgCL085 had 31 unique genes in 30 OGs, 25 of which were on unplaced contigs. Four of the six chromosomal genes were also in Trypanosoma genomes, encoding two hypothetical proteins (a tuzin and a poly ADP-ribose glycohydrolase). 28 of the 31 had orthologs in eukaryotes, of which three had orthologs in the free-living freshwater ciliate protozoan Tetrahymena thermophile (Table S14) [72].

L. guyanensis LgCL085 and L. naiffi LnCL223 had over 300 gene arrays

Gene arrays are genes in the same OG with more than two haploid gene copies: they can be cis or trans. There were 327 gene arrays on L. naiffi LnCL223 (Table S15), 334 on L. guyanensis LgCL085 (Table S16) and 255 on the control L. braziliensis M2904 (Table S17) - half the arrays on each genome had two copies of each gene. 22 of the L. guyanensis, 18 of the L. naiffi LnCL223 and 15 of the control L. braziliensis gene arrays contained 10+ haploid gene copies (Table 3). The L. panamensis PSC-1 genome had ~400 tandem arrays, of which 71% had more than two copies. The L. braziliensis M2904 genome had 615 arrays corresponding to 763 OGs in OrthoMCL v5. Thus, the control genome underestimated the number of gene arrays due to either gene absence or incomplete assembly, indicating that the number of arrays on L. naiffi LnCL223 and L. guyanensis LgCL085 was underestimated.

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 3:

Arrays with ten or more gene copies predicted by read depth for each species. OG stands for orthologous group. B stands for the L. braziliensis M2904 control, G for L. guyanensis LgCL085, and N for L. naiffi LnCL223.

The most expanded array on L. guyanensis LgCL085 contained TATE DNA transposons (OG5_132061) with 50 haploid gene copies (Table 3) compared with 11 on L. naiffi LnCL223, 21 on the L. braziliensis control and 16 on L. panamensis PSC-1. The L. braziliensis M2904 assembly had 40 TATE DNA transposons, but only two were annotated on the control here, illustrating that more accurate estimates of copy number may be possible.

L. naiffi LnCL223 had the highest haploid gene copy number of the leishmanolysin (GP63) array (OG5_126749) with 56 haploid gene copies, compared to 33 in L. guyanensis LgCL085, 28 in L. panamensis PSC-1 and 31 in L. braziliensis M2904. This family was not expanded in L. peruviana LEM1537 or PAB4377. This was consistent with previous work on L. guyanensis leishmanolysin [73] indicating it is a highly expressed virulence factor in promastigotes [74] affecting the survival during the initial stages of infection [74-77]. Sauroleishmania genomes also had high array copy numbers: 37 for L. adleri [69] and 84 for L. tarentolae (Table S12). Leishmania subgenus genomes had lower copy numbers, with 13 for L. mexicana, 15 for L. infantum and five for L. major (OG4_10176 for L. braziliensis M2904, L. mexicana, L. infantum and L. major).

A tuzin gene array (OG5_173452) had higher haploid copy numbers on L. guyanensis LgCL085 (19) and L. panamensis PSC-1 (22) compared with the two copies in L. naiffi, L. mexicana, L. infantum, L. major, L. braziliensis, L. adleri and L. tarentolae. Tuzins are conserved transmembrane proteins in Trypanosoma and Leishmania associated with surface glycoprotein expression [78]. They are often contiguous with δ-amastin genes, whose products are abundant cell surface transmembrane glycoproteins potentially involved in the infection or survival within macrophages. They are absent in Crithidia and Leptomonas species, who lack a vertebrate host stage [78]. Tuzins may play a role in pathogenesis [79], which may be related to MCL caused by L. guyanensis.

Discussion

L. guyanensis and L. naiffi draft reference genomes

We assembled high-quality reference genomes for two isolates, L. (Viannia) guyanensis LgCL085 and L. (Viannia) naiffi LnCL223, from short read sequence libraries to illuminate genomic diversity in the Viannia subgenus and extend previous work [52]. This process combined the de novo assembly with a reference-guided approach using the published genome of L. braziliensis M2904 to assemble the L. guyanensis LgCL085 and L. naiffi LnCL223 into 35 chromosomes each (Table 2). An essential feature of this process was to identify and remove contamination in the L. guyanensis and L. braziliensis M2904 libraries and to trim low-quality bases in L. naiffi LnCL223 to ensure that the reads used were informative and free of exogenous impurities. A second screen for contamination in unassigned contigs also removed several L. guyanensis LgCL085 contigs, which improved subsequent annotation and gene copy number estimates.

Genomes assembled from short reads capture aneuploidy and nearly all genes

Our strategy was tested by applying the same protocol to the L. braziliensis M2904 short read library, which acted as a positive control and quantified the precision of the final output. This facilitated the detection of structural variation or annotation problems: underestimated copy numbers at certain genes and the incorrect assembly of some loci that were fixed manually. The resulting genomes were largely complete: for comparison, the control L. braziliensis M2904 genome had only four homozygous SNPs, 97.2% of the protein coding genes of the reference (231 were missing) and 70 additional genes missed in the reference sequence.

We showed that the majority of Viannia were diploid and had 35 chromosomes. Aneuploidy was evident for L. guyanensis LgCL085, L. guyanensis M4147, L. naiffi LnCL223, L. naiffi M5533, L. lainsoni M6426, L. panamensis WR120 and L. shawi M8408 as anticipated [80]. This was verified using read depth allele frequency distributions of reads mapped to L. braziliensis M2904 and to their own assemblies.

The L. guyanensis LgCL085 genome had more protein coding genes (8,230) than L. naiffi LnCL223 (8,104), and both rates were like those of L. panamensis PSC-1 (7,748) [51] and L. braziliensis M2904 (8,357) [48]. The vast majority of protein coding gene models were computationally transferred [81] from the L. braziliensis M2904 reference with perfect matching, which was verified and improved manually. Both the L. guyanensis and L. naiffi references contained unassigned bin contigs, and chromosomal regions homologous to multiple chromosomal loci or containing partially collapsed gene arrays. 90 (L. naiffi) and 92 (L. guyanensis) collapsed arrays were identified where haploid gene copy numbers were at least twice the assembled copy number when the reads were mapped to the assembled gene arrays.

A better resolution of the Viannia species complexes

This study illustrated that high-throughput sequencing approaches, alignment methods and annotation tools can improve the accuracy of Leishmania gene copy number estimates, gene model details, and genome structure resolution. This yielded insights into features differentiating the isolates examined here, including a variable-length 45 Kb duplication on chromosome 34 of most Viannia, variable gene repertoires across Viannia species, and a potential minichromosome derived from the 3’ end of L. shawi M8408 chromosome 34. Further work is required to investigate L. utingensis and L. lindenbergi as well as other potential distinct lineages [82]. Longer DNA reads would also resolve repetitive regions and gene arrays more accurately.

Both single-gene and large-scale copy number variations (CNVs) were tolerated by all Leishmania genomes. Leishmania genomes have extensive conservation of gene content with few species-specific genes [45, 48]: here, only 31 L. guyanensis LgCL085 and four L. naiffi LnCL223 species-specific genes were found. These four genes unique to L. naiffi LnCL223, its leishmanolysin hyper-amplification, the 31 genes only in L. guyanensis LgCL085 and its tuzin arrays all represent potential targets for improving species-specific typing and better disease surveillance. This is important because infections by the Viannia are spread by many hosts and all sources of infections need to be addressed. Immunological screening of anti-Leishmania antibodies could be enhanced by genetic testing to identify infections from non-endemic or rarer sources like L. naiffi, which has longer parasite survival rates in macrophages in vitro [83].

MLSA of 100 Viannia isolates across four genes and genome-wide diversity inferred from mapped reads indicated that L. guyanensis LgCL085 was closest to L. panamensis PSC-1 within the L. guyanensis species complex, but was assigned the L. guyanensis classification because L. guyanensis, L. panamensis and L. shawi were a monophyletic species complex as shown by MLSA [56], MLMT [64], hsp70 [65], internal transcribed spacer (ITS) [84, 85], MLEE [86] and RAPD data [87]. Further typing of a more extensive L. guyanensis, L. panamensis and L. shawi isolate set might clarify if these are distinct species or a single genetic group.

Conclusion

This study highlighted the utility of genome sequencing for the identification, characterisation and comparison of Leishmania species. We demonstrated that short reads were sufficient for assembly of most Leishmania genomes so that SNP, chromosome copy number, structural and somy changes can be investigated comprehensively. The L. naiffi and L. guyanensis genomes represent a further advance in refining the taxonomical complexity of the Viannia and linking that to nuanced pathologies. Future work could tackle transmission, drug resistance and pathogenesis in the Viannia by applying long-read high-throughput sequencing to examine broader sets of isolates, their genetic diversity, contributions to microbiome variation, and control of transcriptional dosage at gene amplifications.

Methods

L. guyanensis and L. naiffi whole genome sequencing

Extracted DNA for L. guyanensis LgCL085 and L. naiffi LnCL223 was received from Charité University Medicine (Berlin) at the Wellcome Trust Sanger Institute on 6th Feb 2012. Paired-end 100 bp read Illumina HiSeq 2000 libraries were prepared for both during which L. guyanensis required 12 cycles of PCR. The DNA was sequenced (run 7841_5#12) on the 15th (L. guyanensis, run 7841_5#12) and 23rd (L. naiffi, run 7909_7#9) March 2012. The library preparation, sequencing and read quality verification was conducted as outlined previously [69]. The resulting L. guyanensis library contained 15,272,969 reads with a median insert size of 327.0 (NCBI accession ERX180458) and the L. naiffi one had 8,131,246 reads with a median insert size of 335.4 (ERX180449).

Viannia comparative genome, annotation and proteome files

The L. braziliensis reference genome (MHOM/BR/1975/M2904) was a positive control whose short reads were examined using the same methods. It was originally sequenced using an Illumina Genome Analyzer II [48] yielding 26,007,384 76 bp paired-end reads with a median insert size of 244.1 bp (ERX005631). Protein sequences were retrieved from the EMBL files using Artemis [88]. Two L. panamensis genomes, two L. peruviana genome assemblies and five 100 bp paired-end Illumina HiSeq 2000 read libraries of other Viannia isolates [53] were used for comparison (Table 1). We included the genomes of L. panamensis MHOM/PA/1994/PSC-1, L. peruviana PAB-4377 and LEM1537 (MHOM/PE/1984/LC39), and the 100 bp Illumina HiSeq 2000 paired-end reads for each L. peruviana PAB-4377 (16,117,316 reads) and L. peruviana LEM1537 (9,378,317 reads).

Library quality control, contaminant removal and screening

Figure S9 presents an overview of the bioinformatic steps used in this paper. Quality control of the L. guyanensis LgCL085, L. naiffi LnCL223, L. braziliensis M2904, the five Viannia libraries from [53], two L. peruviana libraries and L. panamensis PSC-1 read library was carried out using FastQC (www.bioinformatics.babraham.ac.uk/projects/fastqc/). No corrections were required for the other libraries. An abnormal distribution of GC content per read observed as an extra GC content peak outside the normal peak for the L. braziliensis M2904 and L. guyanensis reads indicated sequence contamination that was removed (Figure S10). Two Illumina PCR primers in the L. braziliensis M2904 reads were removed (Table S1). Further evaluation using GC content filtering and the non-redundant nucleotide database with BLASTn [89] to remove contaminant sequences (Figure S10) with subsequent correction of read pairing arrangements reduced the initial 52,014,768 reads to 34,592,618 properly paired reads for assembly.

The M2904 reads used to assemble a control genome were used for read mapping, error correction and SNP calling and so the contamination did not affect the published reference. However, it did reduce the number of reads mapped as shown in [48] where only 84% of the L. braziliensis M2904 short reads mapped to the L. braziliensis assembly, compared with 92% of reads for L. infantum reads mapped to its own assembly, 93% of L. major reads mapped to its own assembly, and 97% of L. mexicana reads mapped to its own assembly.

The 8,131,246 100 bp paired-end L. naiffi LnCL223 reads and 15,272,969 100 bp paired-end L. guyanensis LgCL085 reads were filtered (Table S1) in the same manner using BLASTn and the smoothness of the GC content distribution to remove putative contaminants. Low quality bases were trimmed at the 3’ end of L. naiffi LnCL223 reads to remove bases with a phred base quality < 30 using Trimmomatic [90] (Table S1, Figure S11). This resulted in 13,033,846 paired-end L. guyanensis LgCL085 sequences and 6,989,814 paired-end L. naiffi LnCL223 sequences — 85% and 86% of the initial reads, respectively (Table S1).

Genome evaluation, assembly and optimisation

Processed reads were assembled into contigs using Velvet v1.2.09 and assemblies for all odd numbered k-mer lengths from 21 to 75 were evaluated. The expected k-mer coverage was determined for each assembly using the mode of a k-mer coverage histogram from the velvet-estimate-exp_cov.pl script in Velvet to maximise resolution of repetitive and unique regions [57]. This suggested optimal k-mers of 61 for L. guyanensis LgCL085 and 43 for both L. naiffi LnCL223 and L. braziliensis, which produced assemblies with the highest N50 lengths. Each assembly was assembled with this expected coverage, and contigs were removed if their average k-mer coverage was less than half the expected coverage levels. An expected coverage of 16 and a coverage cutoff of 8 was applied to L. naiffi reads, an expected coverage of 19 and coverage cutoff of 8.5 to L. guyanensis LgCL085, and an expected coverage of 28 and coverage cutoff of 14 to L. braziliensis.

The assembly with the highest N50 for each was scaffolded using SSPACE [58]. In the initial assemblies, 76% of gaps in scaffolds (3,592/4,754) were closed in for L. guyanensis LgCL085, 63% (4,096/6,530) for L. naiffi LnCL223, and 67% (4,834/8,786) for L. braziliensis using Gapfiller [58]. Erroneous bases were corrected by mapping reads to the references with iCORN [91] (Figure S12). Misassemblies detected and broken using REAPR [60] were aligned to the L. braziliensis M2904 reference (excluding the bin chromosome 00). Scaffolds were evaluated and broken at putative misassemblies detected from the fragment coverage distribution (FCD) error and regions with low coverage when the reads were mapped to both broken and unbroken options. Additionally, the L. braziliensis broken and unbroken scaffolds were used to verify that removing misassemblies prior to (but not after) the contiguation of scaffolds resulted in more accurate assembled chromosomes. Mis-assembled regions without a gap were replaced with N bases. REAPR corrected 444 errors in L. naiffi LnCL223, of which 59 were caused by low fragment coverage, 206 in L. guyanensis LgCL085 (eight deu to low fragment coverage), and 232 in the L. braziliensis control (57 caused by low fragment coverage). Each assembly step improved the corrected N50 and percentage of error free bases (EFB%) assessed using REAPR (Table S18), with the sole exception of L. braziliensis control at the error-correction stage, likely due to its higher heterozygosity. The EFB% was the fraction of the total bases whose reads had no mismatches, matches the expected insert length, had a small FCD error and at least five read pairs oriented in the expected direction.

Gaps > 100 bp were reduced to 100 bp. 200 bp at the edge of each unplaced scaffold was aligned with the 200 bp flanking all pseudo-chromosome gaps using BLASTn to verify that no further gaps could be closed using unplaced scaffolds. Unplaced bin scaffolds < 1 Kb were discarded, and the resulting assemblies were visualised and compared to L. braziliensis using the Artemis Comparison Tool (ACT) [92]. L. guyanensis LgCL085 bin sequences with BLASTn E-values < 1e-05 and percentage identities > 40% to non-Leishmania species in nonredundant nucleotide database were removed as possible contaminants. The final scaffolds were contiguated using the L. braziliensis reference with ABACAS [59], unincorporated segments were labelled as unassigned “bin” contigs, and kDNA contigs were annotated as well (Supplementary Methods).

Phylogenomic MLSA characterisation

A MLSA (multi-locus sequence analysis) approach was adopted to verify the Leishmania species identity using for four housekeeping genes: glucose-6-phosphate dehydrogenase (G6PD), 6-phosphogluconate dehydrogenase (6PGD), mannose phosphate isomerase (MPI) and isocitrate dehydrogenase (ICD). Orthologs from other genomes and assemblies were obtained using BLASTn alignment with thresholds of E-value < 0.05 and percentage identity > 70%. L. peruviana LEM-1537 genome had gaps at the MPI and 6PGD genes and was excluded. The four housekeeping genes spanning 2,902 sites were concatenated in the order G6PD, 6PGD, MPI and ICD, and aligned using Clustal Omega v1.1 to create a Neighbour-Net network of uncorrected p-distances using SplitsTree v4.13.1.

Genome annotation and manual curation

Annotation of the L. guyanensis LgCL085, L. naiffi LnCL223 and L. braziliensis control genomes was completed using Companion [80] using L. braziliensis M2904 as the reference as outlined previously [69], including manal checking and correction of gene models. A control run with the L. braziliensis M2904 reference genome using itself as a reference was performed. In L. naiffi LnCL223, 13 genes and one pseudogene were removed because they overlapped existing superior gene models that had improved sequence identity with L. braziliensis M2904 orthologs. 46 of the protein coding genes were also manually added. 34 of the protein coding genes on L. guyanensis LgCL085 were manually added and one protein coding gene was removed. 269 gene models on L. naiffi LnCL223 and 198 on L. guyanensis with multiple joins mainly caused by the presence of short gaps were corrected by extending the gene model across the gap where the gap length was known (< 100 bp). If the gap length was unknown (> 100 bp), the gene was extended to the nearest start or stop codon.

Measuring ploidy, chromosome copy numbers and CNVs

By mapping the reads with SMALT v5.7 (www.sanger.ac.uk/resources/software/smalt/) to L. braziliensis M2904, the coverage at each site was determined to quantify the chromosome copy numbers and RDAF distributions at heterozygous SNPs as per previous work [69]. The RDAF distribution was based on the coverage level of each allele at heterozygous SNPs and this feature differed across chromosomes for each isolate (Supplementary Results). The median coverage per chromosome was obtained, and the median of the 35 values combined with the RDAF distribution mode approximating 50% indicated that all isolates examined here were mostly diploid (except the triploid L. braziliensis M2904). These were visualised with R packages ggplot2 and gridExtra.

After PCR duplicate removal, the mapped reads were used to detect CNVs across genes or within non-overlapping 10 Kb blocks for all chromosomes and bin contigs using the median depth values normalised by the median of the chromosome (or bin contig). Loci with a copy number > 2 were analysed for L. naiffi LnCL223, L. guyanensis LgCL085 and the L. braziliensis control using their reads mapped to their own assembly. This was also repeated for reads mapped to the L. braziliensis M2904 reference for L. guyanensis M4147, L. naiffi M5533, L. shawi M8408, L. lainsoni M6426, L. panamensis WR120, L. panamensis PSC-1, L. peruviana LEM1537 and L. peruviana PAB-4377. L. panamensis PSC-1 reads were mapped to its own reference genome to verify that we could find previously identified amplified loci, and we mapped L. panamensis WR120 to it so that CNVs shared by both L. panamensis could be obtained. The BAM files of L. naiffi LnCL223, L. guyanensis LgCL085 and L. braziliensis M2904 reads mapped to its own assembly were visualised in Artemis to confirm and refine the boundaries of amplified loci.

Identification of orthologous groups and gene arrays

Protein coding genes from L. guyanensis LgCL085, L. naiffi LnCL223 and the L. braziliensis M2904 control genome were produced from the EMBL files for each genome and these were submitted to the ORTHOMCLdb v5 webserver [93] to identify orthologous groups (OGs). 11,825 OGs with associated gene IDs in at least one of four Leishmania species (L. major strain Friedlin, L. infantum, L. braziliensis and L. mexicana) or five Trypanosoma species (T. vivax, T. brucei, T. brucei gambiense, T. cruzi strain CL Brener and T. congolense) were retrieved from the OrthoMCL database and compared with OGs for each genome. The copy number of each OG was estimated by summing the haploid copy number of each gene in the OG. Gene arrays in each genome were identified by finding all OGs with haploid copy number ≥ 2. Large arrays (≥ 10 gene copies) were examined and arrays with unassembled gene copies were identified by finding those with haploid gene copy number at least twice the assembled gene number.

SNP screening and detection

The filtered reads with Smalt as described mapped above were used for calling SNPs using Samtools Pileup v0.1.11 and Mpileup v0.1.18 and quality-filtered with Vcftools v0.1.12b and Bcftools v0.1.17-dev as previously [69] such that SNPs called by both Pileup and Mpileup post-screening were considered valid. These SNPs all had: base quality >25; mapping quality >30; SNP quality >30; a non-reference RDAF >0.1; forward-reverse read coverage ratios >0.1 and <0.9; five or more reads; 2+ forward reads, and 2+ reverse reads. Low quality and repetitive regions of the assemblies were identified and variants in these regions were masked as outlined elsewhere [69]. SNPs were classed as homozygous for an alternative allele to the reference if their RDAF ≥0.85 and heterozygous if it was > 0.1 and < 0.85.

The high level of nucleotide accuracy of the assembled genomes was indicated by the low rate of homozygous SNPs when the reads mapped to its own assembly (50 for L. naiffi LnCL223, 12 for L. guyanensis LgCL085, 68 for the L. braziliensis reference, and four for the L. braziliensis control). Likewise, the numbers and alleles of heterozygous SNPs for the L. braziliensis control (25,474) matched that for the reference (25,975), suggesting that the 705 (L. naiffi LnCL223) and 14,739 (L. guyanensis LgCL085) heterozygous SNPs were accurate. The difference in homozygous and heterozygous SNP rates for L. braziliensis here versus the original 2011 study [48] were likely due to differing methodology. The genetic divergence of L. naiffi LnCL223 and L. guyanensis LgCL085 compared to L. braziliensis was quantified using the density of heterozygous and homozygous SNPs per 10 Kb non-overlapping window on each chromosome, visualised using Bedtools.

Availability of materials and data

The BioProject ID is PRJEB20208 for L. guyanensis LgCL085 and PRJEB20209 for L. naiffi LnCL223. The DNA reads are available at the NCBI Short Read Archive (SRA) and and European Nucleotide Archive at ERX180458 for L. guyanensis LgCL085 and ERX180449 for L. naiffi LnCL223 (these are associated with BioProject PRJEB2600). The consensus genome sequence FASTA files are on Figshare at 10.6084/m9.figshare.5693290 for L. guyanensis LgCL085 and 10.6084/m9.figshare.5693272 for L. naiffi LnCL223. The chromosome and bin contig annotation EMBL files are at 10.6084/m9.figshare.5693284 for L. guyanensis LgCL085 and 10.6084/m9.figshare.5693278 for L. naiffi LnCL223. The Supplementary Tables are on Figshare at 10.6084/m9.figshare.5697064.

Funding

The authors acknowledge financial support from the NUI Galway Ph.D. Fellowship scheme (S.C.) and the Wellcome Trust core funding of the Wellcome Trust Sanger Institute (WTSI, grant 098051) (J.A.C. and M.S.).

Contributions

S.C. completed the genome assembly, comparative genomics, phylogenetic analysis, mutation investigation, helped design the study and wrote the main manuscript text. S.C., A.S.T. and E. F. completed the genome annotation. M.S. completed genome sequencing. G.S. helped design the study and wrote the main manuscript text. J.A.C. helped design the study and wrote the main manuscript text. T.D. co-ordinated and designed the study and wrote the main manuscript text. All authors gave approval for publication.

Competing interests

The authors have no competing interests.

Acknowledgements

The authors thank: Matthew Berriman and members of the WTSI DNA pipelines team for generating the two sequence libraries; Elisa Cupolillo (Instituto Oswaldo Cruz, Brazil) for discussions and comments on the manuscript; Katrin Kuhls (Technical University of Applied Sciences Wildau), Cathal Seoighe (NUI Galway), Hideo Imamura and Jean-Claude Dujardin (both Institute of Tropical Medicine Antwerp) for help; Anne Stone and Kelly Harkins (both Arizona State University) for releasing valuable sequence read data; and the DJEI/DES/SFI/HEA Irish Centre for High-End Computing (ICHEC) for computational facilities.

Footnotes

  • Email addresses: Simone Coughlan: coughls{at}gmail.com, Ali Taylor: ali.taylor7{at}mail.dcu.ie, Eoghan Feane: eoghan.feane2{at}mail.dcu.ie, Mandy Sanders: mjs{at}sanger.ac.uk, Gabriele Schonian: gabriele.schoenian{at}t-online.de, James A. Cotton: jc17{at}sanger.ac.uk, Tim Downing: tim.downing{at}dcu.ie

References

  1. 1.↵
    Pan American Health Organization: Leishmaniases “Epidemiological Report in the Americas” 2017. Available at http://www2.paho.org/hq/index.php?option=com_docman&task=doc_view&Itemid=270&gid=39646&lang=en
  2. 2.↵
    World Health Organization 2010 Control of the leishmaniases. World Health Organ. Tech. Rep. Ser.
  3. 3.↵
    Gramiccia M and Gradoni L 2005 The current status of zoonotic leishmaniases and approaches to disease control. Int. J. Parasitol. 35 1169–80
    OpenUrlCrossRefPubMedWeb of Science
  4. 4.↵
    Killick-Kendrick R 1999 The biology and control of Phlebotomine sand flies Clin. Dermatol. 17 279–89
    OpenUrl
  5. 5.↵
    Maroli M, M.D. F, Bichaud L, Charrel R N and Gradoni L 2013 Phlebotomine sandflies and the spreading of leishmaniases and other diseases of public health concern Med. Vet. Entomol. 27 123–47
    OpenUrl
  6. 6.
    Walsh J F, Molyneux D H and Birley M H 1993 Deforestation: effects on vectorborne disease. Parasitology 106 Suppl S55–75
    OpenUrlCrossRefPubMedWeb of Science
  7. 7.↵
    Davies C R, Campbell-lendrum D, Reithinger R, Campbell-lendrum D, Feliciangeli D, Borges R and Rodriguez N 2000 The epidemiology and control of leishmaniasis in Andean countries Epidemiologia e controle da leishmaniose nos países andinos Cad. Saude Pública, Rio Janeiro 16 925–50
    OpenUrl
  8. 8.↵
    Rotureau B. Are New World leishmaniases becoming anthroponoses? Med Hypotheses. 2006 67(5):1235–41.
    OpenUrlCrossRefPubMedWeb of Science
  9. 9.↵
    Lainson R and Shaw J J 1989 Leishmania (Viannia) naiffi sp. n., a parasite of the armadillo, Dasypus novemcinctus (L.) in Amazonian Brazil. Ann. Parasitol. Hum. Comp. 64 3–9
    OpenUrlPubMedWeb of Science
  10. 10.↵
    Naiff R D, Freitas R A, Naiff M F, Arias J R, Barrett T V., Momen H and Grimaldi Júnior G 1991 Epidemiological and nosological aspects of Leishmania naiffi Lainson & Shaw, 1989. Mem. Inst. Oswaldo Cruz 86 317–21
    OpenUrlCrossRefPubMedWeb of Science
  11. 11.↵
    Roque A L R and Jansen A M 2014 Wild and synanthropic reservoirs of Leishmania species in the Americas Int. J. Parasitol. Parasites Wildl. 3 251–62
    OpenUrl
  12. 12.↵
    de Souza AAA, da Rocha Barata I, das Graças Soares Silva M, Lima JAN, Jennings YLL, Ishikawa EAY, Prévot G, Ginouves M, Silveira FT, Shaw J, Dos Santos TV. Natural Leishmania (Viannia) infections of phlebotomines (Diptera: Psychodidae) indicate classical and alternative transmission cycles of American cutaneous leishmaniasis in the Guiana Shield, Brazil. Parasite. 2017 24:13. doi: 10.1051/parasite/2017016.
    OpenUrlCrossRef
  13. 13.↵
    Arias J R, Miles M A, Naiff R D, Povoa M M, de Freitas R A, Biancardi C B and Castellon E G 1985 Flagellate infections of Brazilian sand flies (Diptera: Psychodidae): isolation in vitro and biochemical identification of Endotrypanum and Leishmania. Am. J. Trop. Med. Hyg. 34 1098–108
    OpenUrlAbstract/FREE Full Text
  14. 14.↵
    Kato H, Gomez E A, Yamamoto Y, Calvopiña M, Guevara A G, Marco J D, Barroso P A, Iwata H and Hashiguchi Y 2008 Natural infection of Lutzomyia tortura with Leishmania (Viannia) naiffi in an Amazonian area of Ecuador. Am. J. Trop. Med. Hyg. 79 438–40
    OpenUrlAbstract/FREE Full Text
  15. 15.
    Azpurua J, De La Cruz D, Valderama A and Windsor D 2010 Lutzomyia Sand Fly Diversity and Rates of Infection by Wolbachia and an Exotic Leishmania Species on Barro Colorado Island, Panama ed J G Valenzuela PLoS Negl. Trop. Dis. 4 e627
    OpenUrl
  16. 16.↵
    Cássia-Pires R, Boité M C, D’Andrea P S, Herrera H M, Cupolillo E, Jansen A M and Roque A L R 2014 Distinct Leishmania Species Infecting Wild Caviomorph Rodents (Rodentia: Hystricognathi) from Brazil PLoS Negl. Trop. Dis. 8
  17. 17.↵
    Abba, A. M.; Superina M 2010 The 2009/2010 armadillo red list assessment BioOne 11 135–84
    OpenUrl
  18. 18.↵
    Ober H K, Degroote L W, McDonough C M, Mizell R F and Mankin R W 2011 Identification of an attractant for the nine-banded armadillo, Dasypus novemcinctus Wildl. Soc. Bull. 35 421–9
    OpenUrl
  19. 19.↵
    van Thiel P-P A M, Gool T Van, Kager P A and Bart A 2010 First cases of cutaneous leishmaniasis caused by Leishmania (Viannia) naiffi infection in Surinam. Am. J. Trop. Med. Hyg. 82 588–90
    OpenUrlAbstract/FREE Full Text
  20. 20.↵
    Fagundes-Silva G A, Sierra Romero G A, Cupolillo E, Gadelha Yamashita E P, Gomes-Silva A, De Oliveira Guerra J A and Da-Cruz A M 2015 Leishmania (Viannia) naiffi: Rare enough to be neglected? Mem. Inst. Oswaldo Cruz 110 797–800
    OpenUrl
  21. 21.↵
    Lainson R, Shaw J J, Silveira F T, Braga R R and Ishikawa E a 1990 Cutaneous leishmaniasis of man due to Leishmania (Viannia) naiffi Lainson and Shaw, 1989. Ann. Parasitol. Hum. comparée 65 282–4
    OpenUrl
  22. 22.↵
    Pratlong F, Deniau M, Darie H, Eichenlaub S, Pröll S, Garrabe E, le Guyadec T and Dedet J P 2002 Human cutaneous leishmaniasis caused by Leishmania naiffi is widespread in South America. Ann. Trop. Med. Parasitol. 96 781–5
    OpenUrlCrossRefPubMedWeb of Science
  23. 23.↵
    Van Der Snoek E M, Lammers A M, Kortbeek L M, Roelfsema J H, Bart A and Jaspers C A J J 2009 Spontaneous cure of American cutaneous leishmaniasis due to Leishmania naiffi in two Dutch infantry soldiers Clin. Exp. Dermatol. 34
  24. 24.↵
    Floch H 1954 Leishmania tropica guyanensis n.sp. agent de la leishmaniose tegumentarie de Guyanes et de l’Amerique Centralele Arch Inst Pasteur La Guyane Française du Teritoire L’Inni 15
  25. 25.↵
    Lainson R, Shaw J J and Povoa M 1981 The importance of edentates (sloths and anteaters) as primary reservoirs of Leishmania braziliensis guyanensis, causative agent of “pianbois” in north Brazil. Trans. R. Soc. Trop. Med. Hyg. 75 611–2
    OpenUrlCrossRefPubMedWeb of Science
  26. 26.↵
    Arias JR, Naif RD, Miles MA, de Souza AA. The opossum, Didelphis marsupialis (Marsupialia: Didelphidae), as a reservoir host of Leishmania braziliensis guyanensis in the Amazon Basin of Brazil. Trans R Soc Trop Med Hyg. 1981 75(4):537–41.
    OpenUrlCrossRefPubMedWeb of Science
  27. 27.↵
    Dedet JP, Gay F, Chatenay G. Isolation of Leishmania species from wild mammals in French Guiana. Trans R Soc Trop Med Hyg. 1989 Sep-Oct;83(5):613–5.
    OpenUrlCrossRefPubMed
  28. 28.↵
    Quaresma PF, Rêgo FD, Botelho HA, da Silva SR, Moura Júnior AJ, Teixeira Neto RG, Madeira FM, Carvalho MB, Paglia AP, Melo MN, Gontijo CM. Wild, synanthropic and domestic hosts of Leishmania in an endemic area of cutaneous leishmaniasis in Minas Gerais State, Brazil. Trans R Soc Trop Med Hyg. 2011 Oct;105(10):579–85.
    OpenUrlCrossRefPubMed
  29. 29.↵
    Lainson R, Shaw J J, Ward R D, Ready P D and Naiff R D 1979 Leishmaniasis in brazil: XIII. Isolation of leishmania from armadillos (dasypus novemcinctus), and observations on the epidemiology of cutaneous leishmaniasis in north para’ state Trans. R. Soc. Trop. Med. Hyg. 73 239–42
    OpenUrlCrossRefPubMed
  30. 30.↵
    Quinnell R J and Courtenay O 2009 Transmission, reservoir hosts and control of zoonotic visceral leishmaniasis. Parasitology 136 1915–34
    OpenUrlCrossRefPubMedWeb of Science
  31. 31.↵
    Ready P D, et al. 1986 The ecology of lutzomyia umbratilis Ward & Fraiha (Diptera: Psychodidae), the major vector to man of Leishmania braziliensis guyanensis in north-eastern Amazonian brazil Bull. Entomol. Res. 76 21
    OpenUrl
  32. 32.
    Balbino V Q, Marcondes C B, Alexander B, Luna L K S, Lucena M M M, Mendes A C S and Andrade P P 2001 First Report of Lutzomyia (Nyssomyia) umbratilis Ward & Frahia, 1977 outside of Amazonian Region, in Recife, State of Pernambuco, Brazil (Diptera: Psychodidae: Phlebotominae) Mem. Inst. Oswaldo Cruz 96 315–7
    OpenUrlPubMedWeb of Science
  33. 33.↵
    Young D G and Duncan M A 1994 Guide to the identification and geographic distribution of Lutzomyia sand flies in Mexico, the West Indies, Central and South America (Diptera: Psychodidae) vol 54
  34. 34.↵
    Rodríguez-Barraquer I, Góngora R, Prager M, Pacheco R, Montero L M, Navas A, Ferro C, Miranda M C and Saravia N G 2008 Etiologic agent of an epidemic of cutaneous leishmaniasis in Tolima, Colombia Am. J. Trop. Med. Hyg. 78 276–82
    OpenUrlAbstract/FREE Full Text
  35. 35.
    Fouque F, Gaborit P, Issaly J, Carinci R, Gantier J-C, Ravel C and Dedet J-P 2007 Phlebotomine sand flies (Diptera: Psychodidae) associated with changing patterns in the transmission of the human cutaneous leishmaniasis in French Guiana. Mem. Inst. Oswaldo Cruz 102 35–40
    OpenUrlCrossRefPubMedWeb of Science
  36. 36.
    Lainson R, Shaw J J, Ready P D, Miles M A and Póvoa M 1981 Leishmaniasis in Brazil: XVI. Isolation and identification of Leishmania species from sandflies, wild mammals and man in north Para State, with particular reference to L. braziliensis guyanensis causative agent of “pian-bois”. Trans. R. Soc. Trop. Med. Hyg. 75 530–6
    OpenUrlCrossRefPubMedWeb of Science
  37. 37.
    Van Der Meide W F, Jensema A J, Akrum R A E, Sabajo L O A, Lai A Fat R F M, Lambregts L, Schallig H D F H, Van Der Paardt M and Faber W R 2008 Epidemiology of cutaneous leishmaniasis in Suriname: A study performed in 2006 Am. J. Trop. Med. Hyg. 79 192–7
    OpenUrlAbstract/FREE Full Text
  38. 38.
    Garcia A L, Tellez T, Parrado R, Rojas E, Bermudez H and Dujardin J C 2007 Epidemiological monitoring of American tegumentary leishmaniasis: molecular characterization of a peridomestic transmission cycle in the Amazonian lowlands of Bolivia Trans. R. Soc. Trop. Med. Hyg. 101 1208–13
    OpenUrlCrossRefPubMed
  39. 39.↵
    Rotureau B, Ravel C, Nacher M, Couppié P, Curtet I, Dedet J P and Carme B 2006 Molecular epidemiology of Leishmania (Viannia) guyanensis in French Guiana J.
  40. 40.↵
    Delgado O, Cupolillo E, Bonfante-Garrido R, Silva S, Belfort E, Grimaldi Jr G and Momen H 1997 Cutaneous Leishmaniasis in Venezuela Caused by Infection with a New Hybrid between Leishmania (Viannia) braziliensis and L. (V.) guyanensis Mem. Inst. Oswaldo Cruz 92 581–2
    OpenUrlCrossRefPubMedWeb of Science
  41. 41.↵
    Bonfante-Garrido R, Meléndez E, Barroeta S, de Alejos M A, Momen H, Cupolillo E, McMahon-Pratt D and Grimaldi G 1992 Cutaneous leishmaniasis in western Venezuela caused by infection with Leishmania venezuelensis and L. braziliensis variants. Trans. R. Soc. Trop. Med. Hyg. 86 141–8
    OpenUrlCrossRefPubMed
  42. 42.↵
    Jennings Y L, de Souza AAA, Ishikawa E A, Shaw J, Lainson R and Silveira F 2014 Phenotypic characterization of Leishmania species causing cutaneous leishmaniasis in the lower Amazon region, western Pará state, Brazil, reveals a putative hybrid parasite, Leishmania (Viannia) guyanensis χ Leishmania (Viannia) shawi shawi. Parasite 21 39
    OpenUrl
  43. 43.↵
    Tojal da Silva AC, Cupolillo E, Volpini AC, Almeida R, Romero GA. Species diversity causing human cutaneous leishmaniasis in Rio Branco, state of Acre, Brazil. Trop Med Int Health. 2006 11(9):1388–98.
    OpenUrlCrossRefPubMedWeb of Science
  44. 44.↵
    Cortes S, Vaz Y, Neves R, Maia C, Cardoso L, Campino L. Risk factors for canine leishmaniasis in an endemic Mediterranean region. Vet Parasitol. 2012 189(2-4):189–96. doi: 10.1016/j.vetpar.2012.04.028.
    OpenUrlCrossRefPubMed
  45. 45.↵
    Peacock C S, et al. 2007 Comparative genomic analysis of three Leishmania species that cause diverse human disease. Nat. Genet. 39 839–47
    OpenUrlCrossRefPubMedWeb of Science
  46. 46.↵
    Martínez-Calvillo S, Yan S, Nguyen D, Fox M, Stuart K and Myler P J 2003 Transcription of Leishmania major Friedlin chromosome 1 initiates in both directions within a single region. Mol. Cell 11 1291–9
    OpenUrlCrossRefPubMedWeb of Science
  47. 47.↵
    Clayton C and Shapira M 2007 Post-transcriptional regulation of gene expression in trypanosomes and leishmanias Mol. Biochem. Parasitol. 156 93–101
    OpenUrlCrossRefPubMedWeb of Science
  48. 48.↵
    Rogers M B, Hilley J D, Dickens N J, Wilkes J, Bates P a, Depledge D P, Harris D, Her Y, Herzyk P, Imamura H, Otto T D, Sanders M, Seeger K, Dujardin J-C, Berriman M, Smith D F, Hertz-Fowler C and Mottram J C 2011 Chromosome and gene copy number variation allow major structural change between species and strains of Leishmania. Genome Res. 21 2129–42
    OpenUrlAbstract/FREE Full Text
  49. 49.↵
    Wincker P, Ravel C, Blaineau C, Pages M, Jauffret Y, Dedet J P and Bastien P 1996 The Leishmania genome comprises 36 chromosomes conserved across widely divergent human pathogenic species. Nucleic Acids Res. 24 1688–94
    OpenUrlCrossRefPubMedWeb of Science
  50. 50.↵
    Britto C, Ravel C, Bastien P, Blaineau C, Pagès M, Dedet J-P P and Wincker P 1998 Conserved linkage groups associated with large-scale chromosomal rearrangements between Old World and New World Leishmania genomes Gene 222 107–17
    OpenUrlCrossRefPubMedWeb of Science
  51. 51.↵
    Llanes A, Restrepo C M, Del Vecchio G, Anguizola F J and Lleonart R 2015 The genome of Leishmania panamensis: insights into genomics of the L. (Viannia) subgenus. Sci. Rep. 5 8550
    OpenUrlCrossRefPubMed
  52. 52.↵
    Valdivia H O, Reis-Cunha J L, Rodrigues-Luiz G F, Baptista R P, Baldeviano G C, Gerbasi R V, Dobson D E, Pratlong F, Bastien P, Lescano A G, Beverley S M and Bartholomeu D C 2015 Comparative genomic analysis of Leishmania (Viannia) peruviana and Leishmania (Viannia) braziliensis. BMC Genomics 16 715
    OpenUrl
  53. 53.↵
    Harkins K M, Schwartz R S, Cartwright R A and Stone A C 2016 Phylogenomic reconstruction supports supercontinent origins for Leishmania. Infect. Genet. Evol. 38 101–9
    OpenUrl
  54. 54.↵
    Oddone R, Schweynoch C, Schönian G, De Sousa CDS, Cupolillo E, Espinosa D, Arevalo J, Noyes H, Mauricio I and Kuhls K 2009 Development of a multilocus microsatellite typing approach for discriminating strains of Leishmania (Viannia) species J. Clin. Microbiol. 47 2818–25
    OpenUrlAbstract/FREE Full Text
  55. 55.
    Lye L-F F, Owens K, Shi H, Murta S M F, Vieira A C, Turco S J, Tschudi C, Ullu E and Beverley S M 2010 Retention and Loss of RNA interference pathways in trypanosomatid protozoans ed B Ullman PLoS Pathog. 6 e1001161
  56. 56.↵
    Boité M C, Mauricio I L, Miles M A and Cupolillo E 2012 New insights on taxonomy, phylogeny and population genetics of Leishmania (Viannia) parasites based on multilocus sequence analysis. PLoS Negl. Trop. Dis. 6 e1888.
  57. 57.↵
    Zerbino D R and Birney E 2008 Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res. 18 821–9
    OpenUrlAbstract/FREE Full Text
  58. 58.↵
    Boetzer M, Henkel C V, Jansen H J, Butler D and Pirovano W 2011 Scaffolding pre assembled contigs using SSPACE. Bioinformatics 27 578–9
    OpenUrlCrossRefPubMedWeb of Science
  59. 59.↵
    Assefa S, Keane T M, Otto T D, Newbold C and Berriman M 2009 ABACAS: algorithm-based automatic contiguation of assembled sequences. Bioinformatics 25 1968–9
    OpenUrlCrossRefPubMedWeb of Science
  60. 60.↵
    Hunt M, Kikuchi T, Sanders M, Newbold C, Berriman M and Otto T D 2013 REAPR: a universal tool for genome assembly evaluation. Genome Biol. 14 R47
    OpenUrlCrossRefPubMed
  61. 61.↵
    Sievers F, Wilm A, Dineen D, Gibson T J, Karplus K, Li W, Lopez R, McWilliam H, Remmert M, Söding J, Thompson J D and Higgins D G 2011 Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Mol. Syst. Biol. 7 539
    OpenUrlAbstract/FREE Full Text
  62. 62.↵
    Huson D H and Bryant D 2006 Application of phylogenetic networks in evolutionary studies. Mol. Biol. Evol. 23 254–67
    OpenUrlCrossRefPubMedWeb of Science
  63. 63.↵
    Schönian G, Mauricio I and Cupolillo E 2010 Is it time to revise the nomenclature of Leishmania? vol 26
  64. 64.↵
    Kuhls K, Cupolillo E, Silva SO, Schweynoch C, Boité MC, Mello MN, Mauricio I, Miles M, Wirth T, Schönian G. Population structure and evidence for both clonality and recombination among Brazilian strains of the subgenus Leishmania (Viannia). PLoS Negl Trop Dis. 2013 7(10): e2490. doi:10.1371/journal.pntd.0002490.
    OpenUrlCrossRefPubMed
  65. 65.↵
    Fraga J, Montalvo AM, De Doncker S, Dujardin JC, Van der Auwera G (2010) Phylogeny of Leishmania species based on the heat-shock protein 70 gene. Infect Genet Evol 10: 238–245.
    OpenUrlCrossRefPubMed
  66. 66.↵
    Segovia M and Ortiz G 1997 LDI amplifications in Leishmania Parasitol. Today 13 342–8
    OpenUrl
  67. 67.↵
    Fu G, Melville S, Brewster S, Warner J and Barker D C 1998 Analysis of the genomic organisation of a small chromosome of Leishmania braziliensis M2903 reveals two genes encoding GTP-binding proteins, one of which belongs to a new G-protein family and is an antigen Gene 210 325–33
    OpenUrl
  68. 68.↵
    Raymond F, Boisvert S, Roy G, Ritt J-F, Légaré D, Isnard A, Stanke M, Olivier M, Tremblay M J, Papadopoulou B, Ouellette M and Corbeil J 2012 Genome sequencing of the lizard parasite Leishmania tarentolae reveals loss of genes associated to the intracellular stage of human pathogenic species. Nucleic Acids Res. 40 1131–47
    OpenUrlCrossRefPubMedWeb of Science
  69. 69.↵
    Coughlan S, Mulhair P, Sanders M, Schonian G, Cotton J A and Downing T 2017 The genome of Leishmania adleri from a mammalian host highlights chromosome fission in Sauroleishmania Sci. Rep. 7 43747
    OpenUrl
  70. 70.↵
    Hartley M A, Drexler S, Ronet C, Beverley S M and Fasel N 2014 The immunological, environmental, and phylogenetic perpetrators of metastatic leishmaniasis Trends Parasitol. 30 412–22
  71. 71.↵
    Acestor N, Masina S, Ives A, Walker J, Saravia N G and Fasel N 2006 Resistance to oxidative stress is associated with metastasis in mucocutaneous leishmaniasis. J. Infect. Dis. 194 1160–7
    OpenUrlCrossRefPubMedWeb of Science
  72. 72.↵
    Eisen J A, et al. 2006 Macronuclear genome sequence of the ciliate Tetrahymena thermophila, a model eukaryote PLoS Biol. 4 1620–42
    OpenUrlWeb of Science
  73. 73.↵
    Steinkraus H B, Greer J M, Stephenson D C and Langer P J 1993 Sequence heterogeneity and polymorphic gene arrangements of the Leishmania guyanensis gp63 genes Mol. Biochem. Parasitol. 62 173–85
    OpenUrl
  74. 74.↵
    Joshi P B, Sacks D L, Modi G and McMaster W R 1998 Targeted gene deletion of Leishmania major genes encoding developmental stage-specific leishmanolysin (GP63) Mol. Microbiol. 27 519–30
    OpenUrl
  75. 75.
    Joshi P B, Kelly B L, Kamhawi S, Sacks D L and McMaster W R 2002 Targeted gene deletion in Leishmania major identifies leishmanolysin (GP63) as a virulence factor. Mol. Biochem. Parasitol. 120 33–40
    OpenUrlCrossRefPubMedWeb of Science
  76. 76.
    Olivier M, Atayde V D, Isnard A, Hassani K and Shio M T 2012 Leishmania virulence factors: focus on the metalloprotease GP63 Microbes Infect. 14 1377–89
  77. 77.↵
    Brittingham A, Morrison C J, McMaster W R, McGwire B S, Chang K P and Mosser D M 1995 Role of the Leishmania surface protease gp63 in complement fixation, cell adhesion, and resistance to complement-mediated lysis. J. Immunol. 155 3102–11
    OpenUrlAbstract
  78. 78.↵
    Jackson A P 2010 The evolution of amastin surface glycoproteins in trypanosomatid parasites. Mol. Biol. Evol. 27 33–45
    OpenUrlCrossRefPubMedWeb of Science
  79. 79.↵
    Lakshmi B S, Wang R and Madhubala R 2014 Leishmania genome analysis and high-throughput immunological screening identifies tuzin as a novel vaccine candidate against visceral leishmaniasis Vaccine 32 3816–22
  80. 80.↵
    Mannaert A, Downing T, Imamura H and Dujardin J-C C 2012 Adaptive mechanisms in pathogens: universal aneuploidy in Leishmania. Trends Parasitol. 28 370–6
    OpenUrlCrossRefPubMedWeb of Science
  81. 81.↵
    Steinbiss S, Silva-Franco F, Brunk B, Foth B, Hertz-Fowler C, Berriman M and Otto T D 2016 Companion: a web server for annotation and analysis of parasite genomes Nucleic Acids Res. 44 W29–34
    OpenUrlCrossRefPubMed
  82. 82.↵
    Akhoundi M, Downing T, Votýpka J, Kuhls K, Lukeš J, Cannet A, Ravel C, Marty P, Delaunay P, Kasbari M, Granouillac B, Gradoni L, Sereno D. Leishmania infections: Molecular targets and diagnosis. Mol Aspects Med. 2017 doi: 10.1016/j.mam.2016.11.012.
    OpenUrlCrossRef
  83. 83.↵
    Matta NE, Cysne-Finkelstein L, Machado GM, Da-Cruz AM, Leon L. Differences in the antigenic profile and infectivity of murine macrophages of Leishmania (Viannia) parasites. J Parasitol. 2010 Jun;96(3):509–15. doi: 10.1645/GE-2241.1.
    OpenUrlCrossRefPubMed
  84. 84.↵
    Cupolillo E, Grimaldi Júnior G, Momen H and Beverley S M 1995 Intergenic region typing (IRT): a rapid molecular approach to the characterization and evolution of Leishmania. Mol. Biochem. Parasitol. 73 145–55
    OpenUrlCrossRefPubMedWeb of Science
  85. 85.↵
    Berzunza-Cruz M, Cabrera N, Crippa-Rossi M, Sosa Cabrera T, Pérez-Montfort R and Becker I 2002 Polymorphism analysis of the internal transcribed spacer and small subunit of ribosomal RNA genes of Leishmania mexicana. Parasitol. Res. 88 918–25
    OpenUrlCrossRefPubMedWeb of Science
  86. 86.↵
    Cupolillo E, Grimaldi G and Momen H 1994 A general classification of new world Leishmania using numerical zymotaxonomy Am. J. Trop. Med. Hyg. 50 296–311
    OpenUrl
  87. 87.↵
    Bañuls A L, Jonquieres R, Guerrini F, Le Pont F, Barrera C, Espinel I, Guderian R, Echeverria R and Tibayrenc M 1999 Genetic analysis of leishmania parasites in Ecuador: are Leishmania (Viannia) panamensis and Leishmania (V.) Guyanensis distinct taxa? Am. J. Trop. Med. Hyg. 61 838–45
    OpenUrlAbstract
  88. 88.↵
    Carver T, Harris SR, Berriman M, Parkhill J, McQuillan JA. Artemis: an integrated platform for visualization and analysis of high-throughput sequence-based experimental data. Bioinformatics. 2012 28(4):464–9. doi: 10.1093/bioinformatics/btr703.
    OpenUrlCrossRefPubMedWeb of Science
  89. 89.↵
    Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K and Madden T L 2009 BLAST+: architecture and applications. BMC Bioinformatics 10 421
    OpenUrlCrossRefPubMed
  90. 90.↵
    Bolger A M, Lohse M and Usadel B 2014 Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics btu170-
  91. 91.↵
    Otto TD, Sanders M, Berriman M, Newbold C. Iterative Correction of Reference Nucleotides (iCORN) using second generation sequencing technology. Bioinformatics. 2010 26(14):1704–7. doi: 10.1093/bioinformatics/btq269.
    OpenUrlCrossRefPubMedWeb of Science
  92. 92.↵
    Carver T J, Rutherford K M, Berriman M, Rajandream M-A, Barrell B G and Parkhill J 2005 ACT: the Artemis Comparison Tool. Bioinformatics 21 3422–3
    OpenUrlCrossRefPubMedWeb of Science
  93. 93.↵
    Li L, Stoeckert CJ Jr., Roos DS. OrthoMCL: identification of ortholog groups for eukaryotic genomes. Genome Res. 2003 13(9):2178–89.
    OpenUrlAbstract/FREE Full Text
Back to top
PreviousNext
Posted December 14, 2017.
Download PDF

Supplementary Material

Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Leishmania naiffi and Leishmania guyanensis reference genomes highlight genome structure and gene content evolution in the Viannia subgenus
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Leishmania naiffi and Leishmania guyanensis reference genomes highlight genome structure and gene content evolution in the Viannia subgenus
Simone Coughlan, Ali Shirley Taylor, Eoghan Feane, Mandy Sanders, Gabriele Schonian, James A. Cotton, Tim Downing
bioRxiv 233148; doi: https://doi.org/10.1101/233148
Digg logo Reddit logo Twitter logo Facebook logo Google logo LinkedIn logo Mendeley logo
Citation Tools
Leishmania naiffi and Leishmania guyanensis reference genomes highlight genome structure and gene content evolution in the Viannia subgenus
Simone Coughlan, Ali Shirley Taylor, Eoghan Feane, Mandy Sanders, Gabriele Schonian, James A. Cotton, Tim Downing
bioRxiv 233148; doi: https://doi.org/10.1101/233148

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genomics
Subject Areas
All Articles
  • Animal Behavior and Cognition (3480)
  • Biochemistry (7327)
  • Bioengineering (5299)
  • Bioinformatics (20206)
  • Biophysics (9983)
  • Cancer Biology (7705)
  • Cell Biology (11261)
  • Clinical Trials (138)
  • Developmental Biology (6425)
  • Ecology (9919)
  • Epidemiology (2065)
  • Evolutionary Biology (13288)
  • Genetics (9353)
  • Genomics (12558)
  • Immunology (7679)
  • Microbiology (18962)
  • Molecular Biology (7420)
  • Neuroscience (40904)
  • Paleontology (298)
  • Pathology (1226)
  • Pharmacology and Toxicology (2127)
  • Physiology (3141)
  • Plant Biology (6839)
  • Scientific Communication and Education (1270)
  • Synthetic Biology (1893)
  • Systems Biology (5299)
  • Zoology (1086)