Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

The genome sequence of Aloe vera reveals adaptive evolution of drought tolerance mechanisms

Shubham K. Jaiswal, Abhisek Chakraborty, Shruti Mahajan, Sudhir Kumar, View ORCID ProfileVineet K. Sharma
doi: https://doi.org/10.1101/2020.05.29.122895
Shubham K. Jaiswal
1Metagenomics and Systems Biology Group, Department of Biological Sciences, Indian Institute of Science Education and Research Bhopal
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Abhisek Chakraborty
1Metagenomics and Systems Biology Group, Department of Biological Sciences, Indian Institute of Science Education and Research Bhopal
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Shruti Mahajan
1Metagenomics and Systems Biology Group, Department of Biological Sciences, Indian Institute of Science Education and Research Bhopal
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Sudhir Kumar
1Metagenomics and Systems Biology Group, Department of Biological Sciences, Indian Institute of Science Education and Research Bhopal
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Vineet K. Sharma
1Metagenomics and Systems Biology Group, Department of Biological Sciences, Indian Institute of Science Education and Research Bhopal
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Vineet K. Sharma
  • For correspondence: vineetks@iiserb.ac.in
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Preview PDF
Loading

ABSTRACT

Aloe vera is a species from Asphodelaceae plant family having unique characteristics such as drought resistance and also possesses numerous medicinal properties. However, the genetic basis of these phenotypes is yet unknown, primarily due to the unavailability of its genome sequence. In this study, we report the first Aloe vera draft genome sequence comprising of 13.83 Gbp and harboring 86,177 coding genes. It is also the first genome from the Asphodelaceae plant family and is the largest angiosperm genome sequenced and assembled till date. Further, we report the first genome-wide phylogeny of monocots with Aloe vera using 1,440 one-to-one orthologs that resolves the genome-wide phylogenetic position of Aloe vera with respect to the other monocots. The comprehensive comparative analysis of Aloe vera genome with the other available high-quality monocot genomes revealed adaptive evolution in several genes of the drought stress response, CAM pathway, and circadian rhythm in Aloe vera. Further, genes involved in DNA damage response, a key pathway in several biotic and abiotic stress response mechanisms, were found to be positively selected. This provides the genetic basis of the evolution of drought stress tolerance capabilities of Aloe vera. This also substantiates the previously suggested notion that the evolution of unique characters in this species is perhaps due to selection and adaptive evolution rather than the phylogenetic divergence or isolation.

INTRODUCTION

Aloe vera is a succulent and drought-resistant plant belonging to the genus Aloe of family Asphodelaceae [1]. More than 400 species are known in genus Aloe, of which four have medicinal properties with Aloe vera being the most potent species [2]. Aloe vera is a perennial tropical plant with succulent and elongated leaves having a transparent mucilaginous tissue consisting of parenchyma cells in the center referred to as Aloe vera gel [3]. The plant is extensively used as a herb in traditional practices in several countries, and in cosmetics and skin care products due to its pharmacological properties including anti-inflammatory, anti-tumor, anti-viral, anti-ulcers, fungicidal, etc. [4, 5]. These medicinal properties emanate from the presence of numerous chemical constituents such as anthraquinones, vitamins, minerals, enzymes, sterols, amino acids, salicylic acids, and carbohydrates [6, 7]. These properties make it commercially important, with a global market worth 1.6 billion [8].

One of the key characteristics of this succulent plant is drought resistance that enables it to survive in adverse hot and dry climates [1]. The plant has thick leaves arranged in an attractive rosette pattern to the stem. As an adaptation to the hot climate, the plant is able to perform a photosynthetic pathway known as crassulacean acid metabolism (CAM) that helps in limiting the water loss by transpiration [9]. Moreover, the leaves have the capacity to store a large volume of water in their tissues [10]. It is also known to synthesize more of soluble carbohydrates to make the osmotic adjustments under the limited water conditions, thus improving the water use efficiency [11]. Though several studies have been performed on drought stress tolerance and potential benefits of Aloe vera, the unavailability of its reference genome sequence has been a deterrent in understanding the genetic basis and molecular mechanisms of the unique characteristics of this medicinal plant.

In addition to the functional analysis, the resolution of the phylogenetic position has the potential to reveal the evolutionary history, and to understand the correlations between phylogenetic diversity and important traits of interest. Multiple attempts have been made to resolve the phylogenetic position of Aloe genus and Aloe vera, however, these efforts only used a few conserved loci such as rbcL, psbA, matK, and ribosomal genes, and could not be performed at genome-wide level due to the unavailability of the genomic sequence [2, 12, 13]. The previous phylogenies have reported that Aloe vera shared the most common recent ancestor with the species of Poales and Zingiberales order, also within the Asparagales order, it was closest to the other succulent genera such as Haworthia, Gasteria, and Astroloba [14, 15].

The unavailability of the genome sequence of Aloe vera is also noteworthy given the fact that the representative genomes of species from almost all the plant families, including Brassicaceae, Cannabaceae, Cucurbitaceae, Euphorbiaceae, Fabaceae, Malvaceae, Rosaceae, Solanaceae, Poaceae, Orchidaceae, Betulaceae have been sequenced and studied. However, till date, none of the species from the Asphodelaceae plant family has been sequenced. However, an estimate of the genome size of Aloe vera is available in the Plant DNA c-value database, estimated as 16.04 Gbp with a diploid ploidy level containing 14 (2n) chromosomes [16]. Thus, the availability of Aloe vera genome sequence will help to reveal the genomic signatures of Asphodelaceae family and will also be useful in understanding the genetic basis of the important phenotypes such as medicinal properties and drought resistance in Aloe vera.

Therefore in this study, we report the first draft genome sequence of Aloe vera using a hybrid sequencing and assembly approach by combining the Illumina short-read and oxford nanopore long-reads sequences to construct the genome sequence. The transcriptome sequencing and analysis of two tissues, root and leaf, was carried out to gain deep insights into the gene expression and to precisely determine its gene set. The genome-wide phylogeny of Aloe vera with other available monocot genomes was also constructed to resolve its phylogenetic position. The comparative analysis of Aloe vera with other monocot genomes revealed adaptive evolution in its genes and provided insights on the stress tolerance capabilities of this species.

METHODS AND MATERIALS

Sample collection and sequencing

The Aloe vera plant was bought from a plant nursery in Bhopal, India. The pulp or gel from the leaf was scrapped out and the rest was used for the DNA extraction followed by amplification of complete ITS1 and ITS2 (Internal Transcribed Spacer) and Maturase K (MatK) regions for species identification. The library was prepared using NEBNext Ultra II DNA Library preparation Kit for Illumina (New England Biolabs, England) and TruSeq DNA Nano Library preparation kits (Illumina, Inc., United States). The libraries were sequenced on Illumina HiSeq X ten and NovaSeq 6000 platforms (Illumina, Inc., United States) to generate 150 bp paired-end reads. The DNA extraction for long read sequencing was performed as per the Oxford nanopore protocols. The purified samples were used for library preparation by following the protocol of Genomic DNA by Ligation using SQK-LSK109 kit (Oxford Nanopore, UK). The library was loaded on FLO-MIN106 Flow cell (R 9.4.1) and sequenced on MinION (Oxford Nanopore, UK) using MinKNOW software (versions 3.4.5 and 3.6.0). The leaf and root part of plant were used for RNA extraction using TRIzol reagent (Invitrogen, USA). The library was prepared by using TruSeq Stranded mRNA LT Sample Prep kit and following TruSeq Stranded mRNA Sample Preparation Guide (Illumina, Inc., United States) and sequenced on Illumina NovaSeq 6000 platform for 101 bp paired end reads. Prior to sequencing the quality and quantity of libraries were assessed using Agilent 2100 Bioanalyzer (Agilent Technologies, Santa Clara, CA) and qPCR, respectively. The detailed methodology and protocols are mentioned in Supplementary Text S1.

Genome assembly

The raw Illumina sequence data was processed using the Trimmomatic V0.38 tool [17]. For nanopore data, the adapter sequences were removed by using Porechop v0.2.3. SGA-preqc was used to estimate the genome size of Aloe vera using a k-mer count distribution method [18]. The filtered paired and unpaired Illumina reads were de novo assembled using ABySS v2.1.5 [19]. Different assemblies were generated on a sample dataset at increasing k-mer values, which showed the best assembly at k-mer value of 107, and hence the final assembly on complete data was performed at this k-mer value. The preprocessed nanopore reads were de novo assembled using wtdbg2 v2.0.0 [20]. The obtained genome assembly was first corrected for the assembly and sequencing errors using short-read data by SeqBug [21]. The hybrid assembly from the short-read and long-read assembly was generated by considering only those contigs from the ABySS and wtdbg2 assemblies that showed less than 50% query coverage and 90% identity using BLASTN against each other. The RNA-seq data based scaffolding was performed using ‘Rascaf’, followed by the long-read based gap-closing performed using LR_Gapcloser to generate the final Aloe vera genome assembly [22]. The other details about the data preprocessing, genome size estimation, and genome assembly and polishing are mentioned in Supplementary Text S2 and Supplementary Figure S1.

Genome annotation

The genome annotation was performed on all the contigs of hybrid assembly. The tandem repeats were identified using Tandem Repeat Finder (TRF) v4.09 [23]. The microRNAs (miRNAs) were identified using a homology-based approach using miRBase database, and tRNAs were predicted using tRNAscan-SE v2.0.5 [24–27] (Supplementary Text S3).

Transcriptome assembly

The transcriptome assembly of Aloe vera was carried out using the RNA-seq data generated from the root and leaf tissue in this study and previous studies [8, 28]. All the quality-filtered paired and unpaired transcriptome sequencing reads were de novo assembled using Trinity v2.6.6 software with default parameters to generate the assembled transcripts [29]. The transcriptome assembly was evaluated by mapping the filtered RNA-seq data on the assembled transcripts using hisat2 v2.1.0 [30]. The BUSCO score was used to assess the completeness of the transcriptome assembly calculated by BUSCO v4.0.5 software using the standard database specific to the Liliopsida class [31, 32].

Gene set construction

The maker pipeline was used for gene set construction of the Aloe vera genome [33]. The soft-masked genome of Aloe vera (contigs ≥300 bp) generated using RepeatMasker v4.1.0 with Repbase repeat library (RepeatMasker Open-4.0, http://www.repeatmasker.org) was used for the gene prediction using the maker pipeline. Both the ab initio and empirical evidence were used for the gene predictions. The Aloe vera EST evidence from the RNA-seq assembly of Aloe vera species, protein sequences of the closest species Dioscorea rotundata and Musa acuminata, and ab initio gene predictions of the Aloe vera genome were used to construct the gene set using the maker pipeline. AUGUSTUS v3.2.3 was used for the ab initio gene prediction, and the BLAST alignment tool was used for homology-based gene prediction using the EST evidence in the maker pipeline [34–36]. Further, Exonerate v2.2.0 was used to polish and curate the BLAST alignment results (https://github.com/nathanweeks/exonerate). The evidence from ab initio and homology-based methods were integrated to perform the final gene predictions.

The genes from predicted transcripts were identified by extracting the longest isoforms. The unigenes were identified by performing the clustering using CD-HIT-EST v4.8.1 program, and coding regions were predicted using TransDecoder v5.5.0 (https://github.com/TransDecoder/TransDecoder) [37–41]. The gene set constructed using the maker pipeline and transcriptome assembly was filtered, and only the genes with ≥300 bp length were considered further. The clustering of remaining maker pipeline based genes was performed using CD-HIT-EST v4.8.1 program with 95% identity and a seed size of 8 bp [41]. The transcriptome gene set was searched in the maker gene set using BLASTN. The genes from the transcriptome assembly gene set that matched to the maker gene set with the parameters: identity ≥50%, e-value <10−9, and query coverage ≥50% were removed. The remaining genes for the transcriptome assembly gene set were directly added to the maker gene set to construct the final gene set of Aloe vera.

Orthogroups identification

For orthogroups identification, the representative of monocot species from all the clades, for which high-quality genomes were available on Ensembl plants database, were selected along with an outgroup species, the model plant Arabidopsis thaliana. The selected monocot species were Aegilops tauschii, Brachypodium distachyon, Dioscorea rotundata, Eragrostis tef, Hordeum vulgare, Leersia perrieri, Musa acuminata, Oryza sativa, Panicum hallii fil2, Saccharum spontaneum, Setaria italica, Sorghum bicolor, Triticum aestivum, and Zea mays. The proteome files containing all the protein sequences of the 15 species retrieved from Ensembl plants release 46 [42], and the protein-coding genes from the transcriptome assembly of Aloe vera were used to construct the orthogroups. The longest transcript for each gene was extracted for each species using in-house python scripts. The proteome files with longest transcripts were used for the orthogroups identification using OrthoFinder v2.3.9 [43]. The OrthoFinder v2.3.9 analysis included a total of 16 species, i.e., 14 monocot species, the model species Arabidopsis thaliana as an outgroup, and Aloe vera sequenced in this study.

Orthologous gene set construction

From the orthogroups identified by the OrthoFinder analysis, the orthogroups with the taxon count of 16 were extracted, which included genes from each of the 16 species. A total of 5,472 orthogroups were extracted using this criterion. Only the longest gene of each species was retained in each of these orthogroups to construct the orthologous gene set. Thus, a total of 5,472 orthologs were identified across 16 species. From these 5,472 orthologs one-to-one orthologs were extracted. To include maximum number of genes in the one-to-one orthology, the fuzzy one-to-one orthogroups instead of true one-to-one orthogroups were identified using KinFin v1.0 [44]. A total of 1,440 one-to-one orthologs were extracted using this method across the selected 16 species.

Phylogenetic tree construction

The phylogenetic species tree was constructed with the fuzzy one-to-one orthologous genes. The individual orthologous sets were aligned using MAFFT v7.455 [45]. The alignments were trimmed using BeforePhylo v0.9.0 (https://github.com/qiyunzhu/BeforePhylo) to remove the poorly aligned regions. All protein sequence alignments of orthologs across 16 species were concatenated using BeforePhylo v0.9.0, followed by species phylogenetic tree construction using RAxML v8.2.12 [46]. The maximum likelihood phylogenetic tree was constructed using the rapid hill climbing algorithm with the 100 bootstrap replicates. Since the amino acid sequences were used, the ‘PROTGAMMAGTR’ substitution model was utilized to construct the species tree.

Identification of genes with a higher rate of evolution

The genes that show higher root-to-tip branch length are considered to have a higher rate of nucleotide divergence or mutation, indicating a higher rate of evolution. For this analysis, the individual maximum likelihood phylogenetic trees were constructed using the protein sequences of the 5,472 orthologs identified across the 16 species. The maximum likelihood phylogenetic trees with 100 bootstrap replicates were constructed using the rapid hill climbing algorithm with the ‘PROTGAMMAGTR’ substitution model by using RAxML v8.2.12 [46]. The root-to-tip branch length values were calculated for each of the 16 extant species using the ‘adephylo’ package in R [47, 48]. All the genes that showed a significantly higher root-to-tip branch length for Aloe vera in comparison to rest of the species were extracted using in-house Perl scripts and were considered to be the genes with a higher rate of evolution in Aloe vera.

Identification of positively selected genes

The positively selected genes in Aloe vera were identified using the branch-site model implemented in the PAML software package v4.9a [49]. An iterative program for sequence alignment, SAT’e, was utilized to perform the alignments of the 5,472 ortholog protein sequences. The combination of Prank, MUSCLE, and RaxML was used to perform the SAT’e based alignment to control the false positives and false negatives in the alignment [50]. The protein-alignment guided codon alignment was performed for the 5,472 ortholog nucleotide sequences using ‘TRANALIGN’ program of EMBOSS v6.5.7 package [51]. The ‘codeml’ was run on ortholog codon alignments using the species phylogenetic tree constructed in previous steps. The alignments were filtered for the ambiguous codon sites and gaps and only the clean sites were considered for the positive selection analysis. The likelihood ratio tests were performed using the log-likelihood values for the null and alternative models, and the p-values were calculated based on the χ2-distribution. Further, the FDR corrected p-values or FDR q-values were also calculated. All genes with FDR-corrected p-values <0.05 were considered to be the genes with positive selection in Aloe vera. Further, all codon sites with >0.95 probability of being positively selected in the ‘foreground’ branch based on the Bayes Empirical Bayes analysis were considered to be the positively selected codon sites in a gene.

Identification of genes with unique substitutions that have functional impact

The genes with unique amino acid substitutions in Aloe vera species in comparison to all the selected species were identified. The protein alignments for the 5,472 orthologs were generated using the MAFFT v7.455 [45]. The positions that are identical in all the species but different in Aloe vera were identified and considered to be the sites with unique amino acid substitutions in Aloe vera. In this analysis, the gaps were ignored, and also the sites with gaps present in the 10 amino acids flanking regions on both sides were ignored. This step helped in considering only the sites with proper alignment for the unique substitution analysis. The identification of unique amino acid sites was performed by using the in-house python scripts. The functional impact of the unique amino acid substitutions on the protein function was identified using the Sorting Intolerant From Tolerant (SIFT) tool with UniProt database as reference [52, 53].

Identification of genes with multiple signs of adaptive evolution (MSA)

The genes that showed at least two signs of adaptive evolution among the three signs of adaptive evolution tested above (higher rate of evolution, positive selection, and unique substitution with functional impact) were considered as the genes with multiple signs of adaptive evolution or MSA genes in Aloe vera.

Functional annotation

The functional annotation of gene sets was performed using multiple methods. The functional annotation and functional categorization of genes into different eggNOG categories was performed using the eggNOG-mapper [54]. The genes were assigned to different KEGG pathways, and also the KEGG orthology was determined using the most updated KAAS genome annotation server [55]. The gene ontology enrichment or GO term enrichment analysis was performed using the WebGestalt web server [56]. In the over representation analysis, only the GO categories with the p-value <0.05 in the hypergeometric test were considered to be functionally enriched in the gene set. Further, the functional annotation of genes was also manually curated. The assignment of genes to the specific categories and phenotypes was performed by manual annotation. The protein-protein interaction and co-expression data were extracted from the STRING database, and the network analysis was performed using Cytoscape [57, 58].

RESULTS

Sequencing of Aloe vera genome and transcriptome

The estimated genome size of Aloe vera is 16.04 Gbp, and to comprehensively cover this large genome, a total of 506.4 Gbp (~32X) of short-reads and 123.5 Gbp (~7.7X) of long-reads data was generated using Illumina and nanopore platforms, respectively (Supplementary Table S1 and S2) [16, 59]. For transcriptome, a total of 6.6 Gbp and 7.3 Gbp of RNA-seq data was generated from leaf and root, respectively. The transcriptome data from this study and the publicly available RNA-seq data from previous studies [8, 28] were combined together, resulting in a total of 37.1 Gbp of RNA-seq data for Aloe vera, which was used for the analysis (Supplementary Table S3). All the genomic and RNA-seq read data were trimmed and filtered using Trimmomatic, and only the high-quality read data was used to construct the final genome and transcriptome assemblies. The complete workflow of the sequence analysis is shown in Supplementary Figure S1.

Assembly of Aloe vera genome

The final draft genome assembly of Aloe vera had the size of 13.83 Gbp with N50 and largest scaffold of 3.18 kbp and 4.94 Mbp, respectively (Supplementary Table S4). Of which, 12.25 Gbp had length >300bp with N50 of 7.03 kbp, and 9.85 Gbp had length >500bp with N50 of 13.06 kbp, which is a challenging feat for such a gigantic plant genome, and is also comparable to the other large plant genomes assembled till date [60–63]. This was achieved by the hybrid assembly of short-read and long-read data, which was further polished by correction using SeqBug, RNA-seq based scaffolding using Rascaf, and long-read based gapclosing using LR-gapcloser. The k-mer count distribution-based method using only the short Illumina reads estimated a genome size of 13.63 Gb, which was smaller than the c-value-based genome size estimation of 16.04 Gbp, conceivably due to the usage of only short reads data for the genome size estimation (Supplementary Figure S2). The %GC for the final assembly was 41.98%. The analysis of repetitive sequences revealed 557,638,058 bp of tandem repeats corresponding to 3.41% of the complete genome.

Transcriptome assembly

The Trinity assembly of transcriptomic reads resulted in a total size of 163,190,792 bp with an N50 value of 1,268 bp and an average contig length of 796 bp (Supplementary Table S5). The mapping of filtered RNA-seq reads on the Trinity transcripts using hisat2 resulted in the overall percentage mapping of 92.49%. The complete BUSCO score (addition of single copy and duplicates) on the transcripts was 87.7%. A total of 205,029 transcripts were predicted, corresponding to 108,133 genes with the percent GC of 43.69. The clustering of gene sequences using CD-HIT-EST to remove the redundancy resulted in 107,672 unigenes. The coding genes (CDS) from the unigenes were predicted using TransDecoder resulting in 34,269 coding genes.

Genome annotation and gene set construction

A total of 1,978 standard amino acid specific tRNAs and 378 hairpin miRNAs were identified in the Aloe vera genome (Supplementary Table S6). The maker pipeline-based gene prediction resulted in a total of 114,971 coding transcripts, of which 63,408 transcripts (≥300 bp) were considered further for clustering at 95% identity resulting in 57,449 unique coding gene transcripts. Application of the same length-based selection criteria (≥300 bp) on trinity-identified 34,269 coding gene transcripts resulted in 33,998 coding gene transcripts. The merging to these two coding gene transcript sets resulted in the final gene set of 86,177 genes for Aloe vera, which had the complete BUSCO score of 69.0% and single copy BUSCO score of 65.7%.

Identification of orthologous across selected plant species

A total of 104,543 orthogroups were identified using OrthoFinder across the selected 16 plant species, of which 9,343 orthogroups were unique to Aloe vera and contained genes only from Aloe vera. Only a total of 5,472 orthogroups had sequences from all the 16 plant species and were used for the identification of orthologs. For these 5,472 orthogroups, in case of presence of more than one gene from a species in an orthogroup, the longest gene representative from that species was selected to construct the final orthologous gene set for any orthogroups. Thus, including one gene from each of the 16 species in an orthogroup, a total of 5472 orthologs were identified. In addition, the fuzzy one-to-one orthologs finding approach applied using KinFin resulted in a total of 1,440 fuzzy one-to-one orthologs that were used for constructing the maximum likelihood species phylogenetic tree.

Resolving the phylogenetic position of Aloe vera

Each of the 1,440 fuzzy one-to-one orthologous gene set was aligned and concatenated, and the resulted concatenated alignment had a total of 1,453,617 alignment positions. The concatenated alignment was filtered for the undetermined values, which were treated as missing values, and a total of 1,157,550 alignment positions were retained. The complete alignment data and the filtered alignment data were both used to construct maximum likelihood species trees using RAxML with the bootstrap value of 100, and both the alignment data resulted in the same phylogeny. Thus, the phylogeny based on the filtered data was considered to be the final genome-wide phylogeny of Aloe vera with all the representative monocot genomes available on Ensembl plants database and Arabidopsis thaliana as an outgroup (Figure 1). This phylogeny also corroborated with the earlier reported phylogenies by Silvera et al., 2014, Dunemann et al., 2014, and Wang et al., 2016, which were constructed using a limited number of genetic loci [64–66]. It is apparent from the phylogeny that Dioscorea rotundata and Musa acuminata are the most closely related to Aloe vera, and share the same clade (Figure 1). All other selected monocots are distributed in separate clade with Triticum aestivum and Aegilops tauschii being the most distantly related to Aloe vera.

Figure 1.
  • Download figure
  • Open in new tab
Figure 1.

The phylogenetic tree of the selected 14 monocot species, Aloe vera, and Arabidopsis thaliana as an outgroup

The values mentioned at the nodes are the bootstrap values. The scale mentioned is the nucleotide substitutions per base.

Recently an updated plant megaphylogeny has been reported for the vascular plants [14]. The species of Poales order showed similar relative positions in our reported phylogeny and this megaphylogeny. In the megaphylogeny, Musa acuminata was reported to share the most common recent ancestor with the species of Poales order, but in our phylogeny we observed that Musa acuminata shared the most common recent ancestor with Dioscorea rotundata from Dioscoreales order (Figure 1 and Supplementary Figure S3). Also, among the selected monocots, the species of Dioscoreales order was reported to show the earliest divergence. However, in our genome-wide phylogeny, Aloe vera showed the earliest divergence.

Also, with respect to the reported phylogeny of angiosperms, at the order level the Poales and Zingiberales formed a clade, and their ancestor shared the most recent common ancestor with Asparagales, then all three shared a recent ancestor with Dioscoreales [15]. In our genome-wide phylogeny, Zingiberales and Dioscoreales shared the most recent common ancestor, and their ancestor shares the most recent common ancestor with Asparagales, and the three shared a recent ancestor with Poales.

Genes with a higher rate of evolution

A total of 85 genes showed higher rates of evolution in Aloe vera in comparison to the other monocot species. These genes belonged to several eggNOG categories and KEGG pathways, as mentioned in Supplementary Table S7 and Supplementary Table S8, with a higher representation of ribosomal genes. The distribution of enriched (p-value<0.05) biological process GO terms is mentioned in Supplementary Table S9. Also, among these 85 genes, three molecular function GO terms, rRNA binding, structural constituent of cytoskeleton, and structural constituent of ribosome showed an enrichment (p-value<0.05) (Supplementary Table S10). Five transcription factors WRKY, MYB, bHLH, CPP, and LBD showed higher rates of evolution in Aloe vera. Among these, WRKY, MYB, and bHLH are known to be involved in drought stress tolerance [67–69]. There were six chloroplast functioning related genes, namely EMB3127, PnsB3, TL29, IRT3, PDV2, and SIRB, that showed a higher rate of evolution. Notably, the chloroplast function related genes have been implicated in different abiotic stress conditions in plants, including drought [70, 71].

Identification of positively selected genes

A total of 199 genes showed positive selection in Aloe vera with the FDR q-value threshold of 0.05. The distribution of these genes in eggNOG categories, KEGG pathways, and GO term categories are mentioned in Supplementary Table S11-S15. Among the genes with positive selection, several genes were involved in key functions with specific phenotypic consequences (Figure 2). These included flowering related genes that are important for the reproductive success, calcium-ion binding and transcription factors/sequence-specific DNA binding genes involved in signal transduction for response to external stimulus, carbohydrate catabolism genes required for energy production, and genes involved in abiotic stress response [72–74]. Among the abiotic stress response genes, there were four categories of genes that were involved in response to water-related stress, DNA damage response genes involved in reactive oxidative species (ROS) stress response, nuclear pore complex genes involved in plant stress response by regulating the nucleo-cytoplasmic trafficking, and secondary metabolites biosynthesis related genes that deal with different types of biotic and abiotic stresses [75–77]. The robust and efficient DNA damage response mechanism is essential for biotic and abiotic stress tolerance, and for the genomic stability [78]. Thus, adaptive evolution in this pathway seemingly contributes towards the stress tolerance capabilities and genomic stability in Aloe vera.

Figure 2.
  • Download figure
  • Open in new tab
Figure 2.

The functional categories of genes that showed positive selection in Aloe vera

The standard Arabidopsis thaliana gene IDs were used in case of genes that did not have a standard gene symbol.

Another gene G6PD5 that showed positive selection in Aloe vera protects plants against different types of stress, such as salinity stress by producing nitric oxide (NO) molecule, which leads to the expression of Defence response genes [79, 80]. Regulation of osmotic potential under drought stress is acquired by different ion channels, transporters, and carrier proteins [81]. In this study, K+ transporter 1(KT1), bidirectional amino acid transporter 1(BAT1), and Sodium Bile acid symporter (AT4G22840) genes were found to be positively selected in Aloe vera.

The Abscisic acid (ABA) responsive element binding factor (ABF) gene was found to be positively selected. This gene is differentially expressed under drought and other abiotic stress and alters specific target gene expression by binding to ABRE (abscisic acid-response element), the characteristic element of ABA-inducible genes [82]. ABA also regulates stomatal closure and solute transport, and thus have implications in drought tolerance [83]. The trehalase 1 (TRE1) gene was also found to be positively selected, and the over-expression of this gene causes better drought tolerance through ABA guided stomatal closure [84].

Genes with site-specific signs of evolution

Two types of site-specific signatures of adaptive evolution i.e., positively selected codon sites and unique amino acid substitutions with significant functional impact were identified in Aloe vera. A total of 1,848 genes had positively selected codon sites, and a total of 2,669 genes had unique amino acid substitutions with functional impact. The distribution of genes with positively selected codon sites and unique amino acid substitutions with functional impact in eggNOG categories, KEGG pathways, and GO term categories are mentioned in Supplementary Tables S16-S25.

One of the characteristics of succulent plants such as Aloe vera is the ability to efficiently assimilate the atmospheric CO2 and reduce water loss by transpiration through the crassulacean acid metabolism (CAM) pathway a specific mode of photosynthesis. The evolution of CAM is an adaptation to the limited CO2 and limited water condition, and a significant correlation between higher succulence and increased magnitude of CAM metabolism has been observed [85]. In this study, several crucial genes of CAM metabolism showed site-specific signatures of adaptive evolution in Aloe vera (Figure 3). The potassium channel involved in stomatal opening/closure (KAT2), malic enzyme (ME) that converts malic acid to pyruvate, and phosphoenolpyruvate carboxylase (PEPC) that converts phosphoenolpyruvate to oxaloacetate and assimilates the environmental CO2 showed both the signs of site-specific adaptive evolution. In addition, the other CAM genes including potassium transport 2/3 (KT2/3), pyruvate orthophosphate dikinase (PPDK), phosphoenolpyruvate carboxylase kinase 1 (PPCK1), carbonic anhydrase 1 (CA1), peroxisomal NAD-malate dehydrogenase 2 (PMDH2), tonoplast dicarboxylate transporter (TDT), and aluminum activated malate transporter family protein (ALMT9) showed unique substitutions with functional impact in Aloe vera.

Figure 3.
  • Download figure
  • Open in new tab
Figure 3.

The adaptive evolution of CAM pathway in Aloe vera

The important genes of the CAM pathway are shown with their function in the day time and night time metabolism. The genes in Levander color had positively selected codon sites, the genes in Green color had unique substitutions with function impact, and the genes in Red color showed both the signs of site-specific adaptive evolution in Aloe vera. There were no CAM pathway genes that had only positively selected codon sites.

CAM metabolism evolution is known to be a result of modified circadian regulation at the transcription and posttranscriptional levels [86]. CAM evolution is the well-characterized physiological rhythm in plants, and it is also a specific example of circadian clock-based specialization [86, 87]. Several plant circadian rhythm genes showed site-specific signs of adaptive evolution in Aloe vera (Figure 4). Three essential genes of red light response, PHYB, ELF3, and LHY, showed both the signs of site-specific adaptive evolution. Also, the FT gene important for flowering and under the control of circadian rhythm showed both the signs of site-specific adaptive evolution. The PHYA gene, which is also a part of the red light response, had unique substitutions with functional impact. Among the blue light response genes, three genes GI, FKF1, and SPA2 had unique substitutions with functional impact, and two genes HY5 and CHS had positively selected codon acid sites. The blue light response regulates the UV-protection and photomorphogenesis.

Figure 4.
  • Download figure
  • Open in new tab
Figure 4.

The adaptive evolution of circadian rhythm pathway in Aloe vera

The important genes of the plant circadian rhythm are shown with their function. The genes in Levander color had positively selected codon sites, the genes in Green color had unique substitutions with function impact, and the genes in Red color showed both the signs of site-specific adaptive evolution in Aloe vera.

Plant hormone signaling regulates plant growth, development, and response to different types of biotic and abiotic stress [88]. Multiple genes of auxin, cytokinin, and brassinosteroid hormone signaling involved in cellular growth and elongation having implications in cellular and tissue succulence, showed site-specific signatures of adaptive evolution (Figure 5). The genes of the abscisic acid hormone signaling involved in stomatal opening/closure required for CAM metabolism and different biotic and abiotic stress response [82] had positively selected amino acid sites and unique substitution sites with functional impact (Figure 5). Also, the genes involved in salicylic acid signaling important for providing disease resistance and help in biotic stress response showed site-specific signatures of adaptive evolution (Figure 5).

Figure 5.
  • Download figure
  • Open in new tab
Figure 5.

The adaptive evolution of plant hormone signaling pathway in Aloe vera

The important genes of the auxin, cytokinin, abscisic acid, ethylene, brassinosteroid, and salicylic acid signaling pathways are shown with their function. The genes in Levander color had positively selected codon sites, the genes in Green color had unique substitutions with function impact, and the genes in Red color showed both the signs of site-specific adaptive evolution in Aloe vera.

Genes with multiple signs of adaptive evolution

Among the three signatures of adaptive evolution i.e., positive selection, a higher rate of evolution, and unique amino acid substitutions with functional impact, a total of 148 genes showed two or more signs of adaptive evolution and were identified as the genes with multiple signs of adaptive evolution (MSA). The distribution of these genes in eggNOG categories, KEGG pathways, and GO categories are mentioned in Supplementary Table S26-S29. Another study that performed the proteomic analysis of drought stress response in wild peach also found similar categories to be enriched in the proteins that were differentially expressed under drought conditions [89]. A total of 112 genes out of the 148 MSA genes in Aloe vera were from the specific categories that are involved in providing drought stress tolerance. The specific groups of proteins and their relation with the drought stress tolerance are mentioned in Figure 6.

Figure 6.
  • Download figure
  • Open in new tab
Figure 6.

The MSA genes in Aloe vera that are involved in drought stress response

The relation of specific categories of genes with drought stress response was determined from the literature. The standard Arabidopsis thaliana gene IDs were used in case of genes that did not have a standard gene symbol.

Several ribosomal genes, translational regulators, and transcription factors genes were found to be MSA genes in this study, and these were also found to be over-expressed under drought conditions in different proteomic and transcriptomic studies and aid in better drought stress survival [89, 90]. Many nuclear genes are involved in the functioning of symbiotic organelles chloroplast and mitochondria. Among these genes, some genes are also involved in the organellar gene expression (OGEs) regulation, and mutants of these genes are known to show altered response to different abiotic stress, including high salinity stress [91, 92]. Several of these genes belonging to two categories, RNA helicases and PPR domain proteins, were found to be MSA genes. Thus, in the Aloe vera species, these genes have been adaptively evolved to provide this species with better salt tolerance.

Two osmotic biosensor genes, ‘CPA’ and ‘AT2G42100’, were found to be among the MSA genes in Aloe vera. Different membrane transporters that can transport signaling molecules, osmolytes, and metals were also among the MSA genes (Figure 6). These included two peroxisomal transporters ‘PNC1’ a nucleotide carrier protein, and ‘PEX14’ a transporter for PTS1 and PTS2 domain containing signaling proteins, different heavy metal transporters such as ‘IRT3’ an iron transporter, ‘AT5G23760’ a copper transporter, and ‘NRAMP1’ a manganese transporter, and ‘AT4G17650’ a lipid transporter, ‘AT2G40420’ an amino acid transporter, ‘AT5G06120’ an intracellular protein transporter, ‘ALA1’ a phospholipid transporter, ‘NAT8’ a Nucleobase-ascorbate transporter, ‘NRT2.6’ a high-affinity nitrate transporter, and ‘BASS6’ a sodium/metabolite co-transporter. These osmotic sensors and transporters provide significant enhancement in function in drought stress condition and help in adjusting to the water scarcity [93, 94].

The genes for several kinases and WD-40 repeat proteins were also found to be among the MSA genes in Aloe vera. These proteins are involved in signaling and transcription regulation required for the drought stress tolerance [90, 95–97]. Also, the genes involved in energy generation and are part of the thylakoid membrane showed MSA. The stability of thylakoid membrane proteins has been associated with drought resistance, and these energy production related genes are crucial in survival during the drought stress [90, 98]. Two genes that assist in protein folding were found to show MSA (Figure 6), and these proteins are very important in protecting the macromolecules of the cells under the drought stress conditions [99]. Five genes involved in plant hormone signaling were also among the MSA genes. The plant hormone signaling is central to the signaling pathways required for the drought stress tolerance [100]. Five genes involved in flowering and reproduction regulation were also found to be among the MSA genes in Aloe vera. The flowering and reproduction related genes are known to be regulated for better reproductive success under drought stress conditions as part of the drought tolerance strategy used by many plants [101, 102].

The co-expression of MSA genes was examined using the co-expression data from the STRING database [57], and the MSA genes that co-express with at least one other MSA gene are displayed as a network diagram (Figure 7A). From the network, it is evident that almost all co-expressing MSA genes are drought stress tolerance related, and the genes forming the dense network are also drought stress tolerance related. Predominantly, three categories of drought stress tolerance related MSA genes have shown co-expression: genes involved in energy production, genes involved in OGEs regulation, and genes that predispose plants to drought stress tolerance.

Figure 7.
  • Download figure
  • Open in new tab
Figure 7.

Evaluating the co-expression and physical interaction of MSA genes in Aloe vera

[A] The co-expression network of the MSA genes is shown. Only the MSA genes that showed at least one co-expression connection are shown. The nodes represent the genes, and the edges represent the co-expression of the connected nodes.

[B] The protein-protein interaction network of the MSA genes is shown. Only the MSA genes that showed at least one protein-protein interaction are shown. The nodes represent the genes, and the edges represent the protein-protein interaction between the connected nodes.

Note: The standard Arabidopsis thaliana gene IDs were used in case of genes that did not have a standard gene symbol.

Similarly, a network diagram was constructed using the protein-protein interaction data of MSA genes from the STRING database [57]. The genes with physical interaction known from the experimental studies are shown in Figure 7B. From the network, it is apparent that among the interacting MSA genes, all of them except one are involved in drought stress tolerance. Further, among the MSA genes involved in drought stress tolerance, it is primarily the genes that predispose plants to drought stress tolerance, and the genes involved in signal transduction in drought stress response showed the physical interaction. In addition, two genes that function as osmotic biosensors and two OGEs regulation genes also displayed physical interaction.

DISCUSSION

In this work, we have presented the complete draft genome sequence of Aloe vera, which is an evolutionarily important, ornamental, and widely used plant species due to its medicinal properties, pharmacological applications, traditional usage, and commercial value. The availability of Aloe vera genome sequence is also important since it is the first genome sequenced from the Asphodelaceae plant family, and is the largest angiosperm and the fifth largest genome sequenced so far. It is also the largest genome sequenced using the oxford nanopore technology till date. The hybrid approach of using short-read (Illumina) and long-read (nanopore) sequence data emerged as a successful strategy to tackle the challenge of sequencing one of the largest plant genomes.

The study reported the gene set of Aloe vera constructed using the combination of de novo and homology-based gene predictions, and also using the data from the genomic assembly and the transcriptomic assembly from multiple tissues, thus indicating the comprehensiveness of the approach. The Aloe vera had a higher number of coding genes than the other monocots used in this study except for Triticum aestivum, which had more number of coding genes (Supplementary Table S30). The estimation of coding genes in Aloe vera was similar to the number of genes in other monocot genomes suggesting the correctness of the gene prediction and estimation.

This study reported the first genome-wide phylogeny of Aloe vera with all other monocot species available on the Ensembl plant database, and with Arabidopsis thaliana as an outgroup. A few previous studies have also examined the phylogenetic position of Aloe vera with respect to other monocots but used a few genomic loci. Thus, this is the first genome-wide phylogeny of monocots that resolves the phylogenetic position of Aloe vera with respect to the other monocots by using 1,440 different loci distributed throughout their genomes. The very high bootstrap values for the internal nodes and existence of no polytomy in the phylogeny further attest to the correctness of the phylogeny. This phylogeny is mostly in agreement with the previously known phylogenies, and also provided some new insights [2, 14, 64–66].

An earlier phylogeny constructed using “ppc-aL1a” gene showed that Sorghum bicolor, Zea mays, Setaria italica, Brachypodium distachyon, Hordeum vulgare, and Oryza sativa form a monophyletic group, which was also observed in our phylogeny [66]. Similarly, the relative positions of Hordeum vulgare, Saccharum officinarum, Zea mays, and Oryza sativa in another phylogeny based on “CENH3” gene were in agreement with our phylogeny [64]. Using the “NORK” gene, another recent study reported the relative phylogenetic position of four monocot species: Oryza sativa, Zea mays, Sorghum bicolour, and Setaria italica [65]. Zea mays, Sorghum bicolour, and Setaria italica were found to share a recent last common ancestor and Oryza sativa had diverged earlier from their common ancestor, which is also supported by the genome-wide phylogeny reported in this study.

Though the genome-wide phylogeny showed the species of Poales order with similar topology as reported in earlier studies, a different topology was observed for the relative position of Musa acuminata, Dioscorea rotundata, and Aloe vera from the orders Zingiberales, Dioscoreales, and Asparagales, respectively [14, 15]. The observed differences could be due to the usage of a few genomic loci in the previous phylogenies, whereas the phylogeny reported in this study is a genome-wide phylogeny constructed using 1,440 one-to-one orthologs distributed across the genome. The availability of more complete genomes from monocots and the inclusion of more genomic loci in the phylogenetic analysis will help explain the observed differences and confirm the relative positions of these species.

One of the key highlights of the study was the revelation of adaptive evolution of genes involved in drought stress response, which provides a genetic explanation for the drought stress tolerance properties of Aloe vera. This plant is known to display a number of phenotypes such as perennial succulent leaves and CAM mechanism for carbon fixation that provide it with better drought stress survival [10]. Several experimental studies have also reported that it can make adjustments such as increased production of sugars and increased expression of heat-shock and ubiquitin proteins for efficient water utilization and osmotic maintenance that eventually provide better drought survival [11, 103, 104]. In this study, the majority (80%) of genes that showed multiple signs of evolution (MSA) were involved in drought stress tolerance related functions. These genes were also found to be co-expressing and physically interacting with each other, which further point towards the adaptive evolution of the drought stress tolerance mechanisms in this species. The adaptive evolution of genes involved in drought stress tolerance provides insights into the genetic basis of the drought resistance property of Aloe vera.

Further, several crucial genes of the CAM pathway and circadian rhythm have also shown site-specific signs of adaptive evolution in Aloe vera in comparison to the other monocot species. The CAM pathway has very high water use efficiency, and is known to have evolved convergently in many arid regions for better drought survival [105]. Also, the CAM pathway is a physiological rhythm with temporal separation of atmospheric CO2 assimilation and Calvin-Benson cycle, and is under the control of plant circadian rhythm [86, 106]. This CAM pathway evolution is known to be a specific type of circadian rhythm specialization [87, 107]. Thus, the observed adaptive evolution of CAM pathway and its controller circadian rhythm in this study point towards its role in providing this species an evolutionary advantage for efficient drought stress survival.

The evolutionary success of the Aloe genus is also known to be due to the succulent leaf Mesophyll tissue [2]. Particularly, the medicinal use of Aloe vera is much associated with the succulent leaf mesophyll tissue, and a loss of this tissue leads to the loss of medicinal properties [3]. The plant species with CAM pathway have large vacuoles in comparison to the non-CAM plants, and therefore the leaf succulence is also higher in CAM plants. Thus, it is tempting to speculate that the observed evolution of CAM pathway in Aloe vera may also be crucial for the higher leaf mesophyll succulence contributing to its medicinal properties. Also previously, it has been proposed that the specific properties of Aloe vera such as the high leaf succulence, medicinal properties, and drought resistance are the consequence of evolutionary processes such as selection and speciation rather than due to phylogenetic diversity or isolation [2]. The signatures of adaptive evolution in drought tolerance and CAM pathway genes in Aloe vera further substantiate this notion.

CONCLUSION

The first draft genome, transcriptome, gene set, and functional analysis of Aloe vera reported in this study will act as a reference for future studies to understand the medicinal or evolutionary characteristics of this species, and its family Asphodelaceae. The first genome-wide phylogeny of Aloe vera and other available monocot genomes resolved the phylogenetic position of Aloe vera and emphasized the need for the availability of more genomes for precise phylogenetic analysis. The comparative genomic analyses of Aloe vera with the other monocot genomes provided novel insights on the adaptive evolution of drought stress response, CAM pathway, and circadian rhythm genes in Aloe vera, and suggest that the positive selection and adaptive evolution of specific genes contribute to the unique phenotypes of this species.

COMPETING INTERESTS

The authors declare no competing financial and non-financial interest.

AUTHORS’ CONTRIBUTIONS

VKS conceived and coordinated the project. SM prepared the DNA and RNA samples, performed sequencing, and the species identification assay. SKJ with the input from VKS designed the computational framework of the study. SKJ and AC performed the genome assembly, transcriptome assembly, genome annotation, gene set construction, orthology analysis, and species phylogenetic tree construction. SKJ performed the root-to-tip branch length, positive selection, unique substitution with functional impact, network, and statistical analysis. SKJ, AC, SK, and SM performed the functional annotation of gene sets. SKJ, AC, and VKS analysed the data. SKJ, AC, and VKS interpreted the results. SKJ and AC constructed the figures. SKJ, AC, SM, SK, and VKS wrote and revised the manuscript. All the authors have read and approved the final version of the manuscript.

ACKNOWLEDGEMENTS

The author SKJ thanks Department of Science and Technology for the DST-INSPIRE fellowship. AC and SM thank Council of Scientific and Industrial Research (CSIR) for fellowship. SK thanks University Grants Commission (UGC) for the fellowship. We also thank the intramural research funds provided by IISER Bhopal.

Footnotes

  • Email addresses of authors: Shubham K. Jaiswal – shubhamj{at}iiserb.ac.in, Abhisek Chakraborty – abhisek18{at}iiserb.ac.in, Shruti Mahajan – shruti17{at}iiserb.ac.in, Sudhir Kumar – sudhir19{at}iiserb.ac.in, Vineet K. Sharma – vineetks{at}iiserb.ac.in

LIST OF ABBREVIATIONS

MSA
Multiple signs of adaptive evolution
CAM
Crassulacean acid metabolism
COG
Clusters of Orthologous Groups
KEGG
Kyoto Encyclopedia of Genes and Genomes
GO
Gene ontology
BUSCO
Benchmarking Universal Single-Copy Orthologs
SIFT
Sorting Intolerant From Tolerant
FDR
False discovery rate
BLAST
Basic Local Alignment Search Tool
N50
minimum contig length needed to cover 50% of the genome
ABA
Abscisic acid
snoRNA
small nucleolar RNA
snRNA
small nuclear RNA
tRNA
transfer RNA
rRNA
ribosomal RNA
srpRNA
signal recognition particle RNA
miRNA
micro RNA
MYB
Myeloblastosis
bHLH
basic helix–loop– helix
CPP
cysteine-rich polycomb-like protein
LBD
Lateral Organ Boundaries (LOB) Domain
EMB3127
Embryo Defective 3127
PnsB3
Photosynthetic NDH subcomplex B3
TL29
Thylakoid Lumen 29
IRT3
Iron regulated transporter 3
PDV2
Plastid Division2
SIRB
Sirohydrochlorin ferrochelatase B
G6PD5
Glucose-6-phosphate dehydrogenase 5
KAT2
Potassium channel in Arabidopsis thaliana 2
PHYB
Phytochrome B
ELF3
Early Flowering 3
LHY
Late Elongated Hypocotyl
FT
Flowering locus T
PHYA
Phytochrome A
GI
Gigantea
FKF1
Flavin-binding, Kelch repeat, F box 1
SPA1
Suppressor of PHYA-105 1
HY5
Elongated Hypocotyl5
CHS
Chalcone synthase
CPA
Capping Protein A
PNC1
Peroxisomal adenine nucleotide carrier 1
PEX14
Peroxin 14
IRT3
Iron regulated transporter 3
NRAMP1
Natural Resistance-Associated Macrophage Protein 1
ALA1
Aminophospholipid ATPase 1
NAT8
Nucleobase-Ascorbate Transporter 8
NRT2.6
High affinity Nitrate Transporter 2.6
ppc-aL1a
Phosphoenolpyruvate carboxylase
CENH3
Centromeric histone H3
NORK
Nodulation receptor kinase
PPR
Pentatricopeptide Repeat
PTS1
Peroxisomal targeting signal 1
PTS2
Peroxisomal targeting signal 2
LTR-RT
Long terminal repeat Retrotransposons
EST
Expressed sequence tag

REFERENCES

  1. 1.↵
    Silva H, Sagardia S, Seguel O, Torres C, Tapia C, Franck N, Cardemil L: Effect of water availability on growth and water use efficiency for biomass and gel production in Aloe Vera (Aloe barbadensis M.). Industrial Crops and Products 2010, 31(1):20–27.
    OpenUrl
  2. 2.↵
    Grace OM, Buerki S, Symonds MR, Forest F, van Wyk AE, Smith GF, Klopper RR, Bjorå CS, Neale S, Demissew S: Evolutionary history and leaf succulence as explanations for medicinal use in aloes and the global popularity of Aloe vera. BMC evolutionary biology 2015, 15(1):29.
    OpenUrl
  3. 3.↵
    Reynolds T, Dweck A: Aloe vera leaf gel: a review update. Journal of ethnopharmacology 1999, 68(1-3):3–37.
    OpenUrlCrossRefPubMedWeb of Science
  4. 4.↵
    Gupta VK, Malhotra S: Pharmacological attribute of Aloe vera: Revalidation through experimental and clinical studies. Ayu 2012, 33(2):193.
    OpenUrl
  5. 5.↵
    Raksha B, Pooja S, Babu S: Bioactive compounds and medicinal properties of Aloe vera L.: An update. Journal of Plant Sciences 2014, 2(3):102–107.
    OpenUrl
  6. 6.↵
    Hamman JH: Composition and applications of Aloe vera leaf gel. Molecules 2008, 13(8):1599–1616.
    OpenUrlCrossRefPubMedWeb of Science
  7. 7.↵
    Joseph B, Raj SJ: Pharmacognostic and phytochemical properties of Aloe vera linn an overview. International journal of pharmaceutical sciences review and research 2010, 4(2):106–110.
    OpenUrl
  8. 8.↵
    Choudhri P, Rani M, Sangwan RS, Kumar R, Kumar A, Chhokar V: De novo sequencing, assembly and characterisation of Aloe vera transcriptome and analysis of expression profiles of genes related to saponin and anthraquinone metabolism. BMC genomics 2018, 19(1):427.
    OpenUrl
  9. 9.↵
    Nobel PS, Jordan PW: Transpiration stream of desert species: resistances and capacitances for a C3, a C4, and a CAM plant. Journal of experimental botany 1983, 34(10):1379–1391.
    OpenUrlCrossRefWeb of Science
  10. 10.↵
    Jin ZM, Wang CH, Liu ZP, Gong WJ: Physiological and ecological characters studies on Aloe vera under soil salinity and seawater irrigation. Process Biochemistry 2007, 42(4):710–714.
    OpenUrl
  11. 11.↵
    Delatorre-Herrera J, Delfino I, Salinas C, Silva H, Cardemil L: Irrigation restriction effects on water use efficiency and osmotic adjustment in Aloe Vera plants (Aloe barbadensis Miller). Agricultural Water Management 2010, 97(10):1564–1570.
    OpenUrl
  12. 12.↵
    Adams SP, Leitch IJ, Bennett MD, Chase MW, Leitch AR: Ribosomal DNA evolution and phylogeny in Aloe (Asphodelaceae). American Journal of Botany 2000, 87(11):1578–1583.
    OpenUrlAbstract/FREE Full Text
  13. 13.↵
    Treutlein J, Smith GF, Van Wyk B-E, Wink M: Phylogenetic relationships in Asphodelaceae (subfamily Alooideae) inferred from chloroplast DNA sequences (rbcL, matK) and from genomic fingerprinting (ISSR). Taxon 2003, 52(2):193–207.
    OpenUrlCrossRefWeb of Science
  14. 14.↵
    Qian H, Jin Y: An updated megaphylogeny of plants, a tool for generating plant phylogenies and an analysis of phylogenetic community structure. Journal of Plant Ecology 2016, 9(2):233–239.
    OpenUrlCrossRef
  15. 15.↵
    Chase MW, Christenhusz M, Fay M, Byng J, Judd WS, Soltis D, Mabberley D, Sennikov A, Soltis PS, Stevens PF: An update of the Angiosperm Phylogeny Group classification for the orders and families of flowering plants: APG IV. Botanical Journal of the Linnean Society 2016, 181(1):1–20.
    OpenUrlCrossRef
  16. 16.↵
    Zonneveld BJ: Genome size analysis of selected species of Aloe (Aloaceae) reveals the most primitive species and results in some new combinations. Bradleya 2002, 2002(20):5–12.
    OpenUrl
  17. 17.↵
    Bolger AM, Lohse M, Usadel B: Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 2014, 30(15):2114–2120.
    OpenUrlCrossRefPubMedWeb of Science
  18. 18.↵
    Simpson JT, Durbin R: Efficient de novo assembly of large genomes using compressed data structures. Genome research 2012, 22(3):549–556.
    OpenUrlAbstract/FREE Full Text
  19. 19.↵
    Birol I, Jackman SD, Nielsen CB, Qian JQ, Varhol R, Stazyk G, Morin RD, Zhao Y, Hirst M, Schein JE: De novo transcriptome assembly with ABySS. Bioinformatics 2009, 25(21):2872–2877.
    OpenUrlCrossRefPubMedWeb of Science
  20. 20.↵
    Ruan J, Li H: Fast and accurate long-read assembly with wtdbg2. Nature methods 2020, 17(2):155–158.
    OpenUrl
  21. 21.↵
    Mittal P, Jaiswal SK, Vijay N, Saxena R, Sharma VK: Comparative analysis of corrected tiger genome provides clues to its neuronal evolution. Scientific reports 2019, 9(1):1–11.
    OpenUrl
  22. 22.↵
    Song L, Shankar DS, Florea L: Rascaf: improving genome assembly with RNA sequencing data. The plant genome 2016, 9(3).
  23. 23.↵
    Benson G: Tandem repeats finder: a program to analyze DNA sequences. Nucleic acids research 1999, 27(2):573–580.
    OpenUrlCrossRefPubMedWeb of Science
  24. 24.↵
    Bolser D, Staines DM, Pritchard E, Kersey P: Ensembl plants: integrating tools for visualizing, mining, and analyzing plant genomics data. In: Plant bioinformatics. Springer; 2016: 115–140.
  25. 25.
    Howe KL, Contreras-Moreira B, De Silva N, Maslen G, Akanni W, Allen J, Alvarez-Jarreta J, Barba M, Bolser DM, Cambell L: Ensembl Genomes 2020—enabling non-vertebrate genomic research. Nucleic acids research 2020, 48(D1):D689–D695.
    OpenUrl
  26. 26.
    Griffiths-Jones S, Saini HK, van Dongen S, Enright AJ: miRBase: tools for microRNA genomics. Nucleic acids research 2007, 36(suppl_1):D154–D158.
    OpenUrlCrossRefPubMedWeb of Science
  27. 27.↵
    Chan PP, Lowe TM: tRNAscan-SE: searching for tRNA genes in genomic sequences. In: Gene Prediction. Springer; 2019: 1–14.
  28. 28.↵
    Wickett NJ, Mirarab S, Nguyen N, Warnow T, Carpenter E, Matasci N, Ayyampalayam S, Barker MS, Burleigh JG, Gitzendanner MA: Phylotranscriptomic analysis of the origin and early diversification of land plants. Proceedings of the National Academy of Sciences 2014, 111(45):E4859–E4868.
    OpenUrlAbstract/FREE Full Text
  29. 29.↵
    Haas BJ, Papanicolaou A, Yassour M, Grabherr M, Blood PD, Bowden J, Couger MB, Eccles D, Li B, Lieber M: De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis. Nature protocols 2013, 8(8):1494.
    OpenUrl
  30. 30.↵
    Kim D, Langmead B, Salzberg SL: HISAT: a fast spliced aligner with low memory requirements. Nature methods 2015, 12(4):357–360.
    OpenUrl
  31. 31.↵
    Waterhouse RM, Seppey M, Simão FA, Manni M, Ioannidis P, Klioutchnikov G, Kriventseva EV, Zdobnov EM: BUSCO applications from quality assessments to gene prediction and phylogenomics. Molecular biology and evolution 2018, 35(3):543–548.
    OpenUrlCrossRefPubMed
  32. 32.↵
    Simão FA, Waterhouse RM, Ioannidis P, Kriventseva EV, Zdobnov EM: BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 2015, 31(19):3210–3212.
    OpenUrlCrossRefPubMed
  33. 33.↵
    Campbell MS, Holt C, Moore B, Yandell M: Genome annotation and curation using MAKER and MAKER-P. Current protocols in bioinformatics 2014, 48(1):4.11. 11–14.11. 39.
    OpenUrlCrossRef
  34. 34.↵
    Stanke M, Keller O, Gunduz I, Hayes A, Waack S, Morgenstern B: AUGUSTUS: ab initio prediction of alternative transcripts. Nucleic acids research 2006, 34(suppl_2):W435–W439.
    OpenUrlCrossRefPubMedWeb of Science
  35. 35.
    Stanke M, Steinkamp R, Waack S, Morgenstern B: AUGUSTUS: a web server for gene finding in eukaryotes. Nucleic acids research 2004, 32(suppl_2):W309–W312.
    OpenUrlCrossRefPubMedWeb of Science
  36. 36.↵
    Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool. Journal of molecular biology 1990, 215(3):403–410.
    OpenUrlCrossRefPubMedWeb of Science
  37. 37.↵
    Suzek BE, Wang Y, Huang H, McGarvey PB, Wu CH, Consortium U: UniRef clusters: a comprehensive and scalable alternative for improving sequence similarity searches. Bioinformatics 2015, 31(6):926–932.
    OpenUrlCrossRefPubMed
  38. 38.
    Bateman A, Coin L, Durbin R, Finn RD, Hollich V, Griffiths-Jones S, Khanna A, Marshall M, Moxon S, Sonnhammer EL: The Pfam protein families database. Nucleic acids research 2004, 32(suppl_1):D138–D141.
    OpenUrlCrossRefPubMedWeb of Science
  39. 39.
    Finn RD, Clements J, Eddy SR: HMMER web server: interactive sequence similarity searching. Nucleic acids research 2011, 39(suppl_2):W29–W37.
    OpenUrlCrossRefPubMedWeb of Science
  40. 40.
    Buchfink B, Xie C, Huson DH: Fast and sensitive protein alignment using DIAMOND. Nature methods 2015, 12(1):59.
    OpenUrl
  41. 41.↵
    Fu L, Niu B, Zhu Z, Wu S, Li W: CD-HIT: accelerated for clustering the next-generation sequencing data. Bioinformatics 2012, 28(23):3150–3152.
    OpenUrlCrossRefPubMedWeb of Science
  42. 42.↵
    Zerbino DR, Achuthan P, Akanni W, Amode MR, Barrell D, Bhai J, Billis K, Cummins C, Gall A, Girón CG: Ensembl 2018. Nucleic acids research 2018, 46(D1):D754–D761.
    OpenUrlCrossRefPubMed
  43. 43.↵
    Emms DM, Kelly S: OrthoFinder: phylogenetic orthology inference for comparative genomics. Genome biology 2019, 20(1):1–14.
    OpenUrlCrossRef
  44. 44.↵
    Laetsch DR, Blaxter ML: KinFin: software for Taxon-Aware analysis of clustered protein sequences. G3: Genes, Genomes, Genetics 2017, 7(10):3349–3357.
    OpenUrl
  45. 45.↵
    Katoh K, Standley DM: MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Molecular biology and evolution 2013, 30(4):772–780.
    OpenUrlCrossRefPubMedWeb of Science
  46. 46.↵
    Stamatakis A: RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 2014, 30(9):1312–1313.
    OpenUrlCrossRefPubMedWeb of Science
  47. 47.↵
    Jombart T, Dray S, Dray MS: Package ‘adephylo’. 2017.
  48. 48.↵
    Jombart T, Dray S: adephylo: exploratory analyses for the phylogenetic comparative method. Bioinformatics 2010, 26(15):1–21.
    OpenUrlCrossRefPubMedWeb of Science
  49. 49.↵
    Yang Z: PAML 4: phylogenetic analysis by maximum likelihood. Molecular biology and evolution 2007, 24(8):1586–1591.
    OpenUrlCrossRefPubMedWeb of Science
  50. 50.↵
    Liu K, Warnow TJ, Holder MT, Nelesen SM, Yu J, Stamatakis AP, Linder CR: SATe-II: very fast and accurate simultaneous estimation of multiple sequence alignments and phylogenetic trees. Systematic biology 2012, 61(1):90.
    OpenUrlCrossRefPubMed
  51. 51.↵
    Rice P, Longden I, Bleasby A: EMBOSS: the European molecular biology open software suite. In.: Elsevier current trends; 2000.
  52. 52.↵
    Ng PC, Henikoff S: SIFT: Predicting amino acid changes that affect protein function. Nucleic acids research 2003, 31(13):3812–3814.
    OpenUrlCrossRefPubMedWeb of Science
  53. 53.↵
    Boutet E, Lieberherr D, Tognolli M, Schneider M, Bairoch A: Uniprotkb/swiss-prot. In: Plant bioinformatics. Springer; 2007: 89–112.
  54. 54.↵
    Huerta-Cepas J, Forslund K, Coelho LP, Szklarczyk D, Jensen LJ, Von Mering C, Bork P: Fast genome-wide functional annotation through orthology assignment by eggNOG-mapper. Molecular biology and evolution 2017, 34(8):2115–2122.
    OpenUrlCrossRef
  55. 55.↵
    Moriya Y, Itoh M, Okuda S, Yoshizawa AC, Kanehisa M: KAAS: an automatic genome annotation and pathway reconstruction server. Nucleic acids research 2007, 35(suppl_2):W182–W185.
    OpenUrlCrossRefPubMedWeb of Science
  56. 56.↵
    Liao Y, Wang J, Jaehnig EJ, Shi Z, Zhang B: WebGestalt 2019: gene set analysis toolkit with revamped UIs and APIs. Nucleic acids research 2019, 47(W1):W199–W205.
    OpenUrlCrossRef
  57. 57.↵
    Szklarczyk D, Morris JH, Cook H, Kuhn M, Wyder S, Simonovic M, Santos A, Doncheva NT, Roth A, Bork P: The STRING database in 2017: quality-controlled protein–protein association networks, made broadly accessible. Nucleic acids research 2016:gkw937.
  58. 58.↵
    Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, Amin N, Schwikowski B, Ideker T: Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome research 2003, 13(11):2498–2504.
    OpenUrlAbstract/FREE Full Text
  59. 59.↵
    Dolezel J, Bartos J, Voglmayr H, Greilhuber J: Nuclear DNA content and genome size of trout and human. Cytometry Part A: the journal of the International Society for Analytical Cytology 2003, 51(2):127–128; author reply 129.
    OpenUrl
  60. 60.↵
    Nystedt B, Street NR, Wetterbom A, Zuccolo A, Lin Y-C, Scofield DG, Vezzi F, Delhomme N, Giacomello S, Alexeyenko A: The Norway spruce genome sequence and conifer genome evolution. nature 2013, 497(7451):579–584.
    OpenUrlCrossRefPubMedWeb of Science
  61. 61.
    Neale DB, Wegrzyn JL, Stevens KA, Zimin AV, Puiu D, Crepeau MW, Cardeno C, Koriabine M, Holtz-Morris AE, Liechty JD: Decoding the massive genome of loblolly pine using haploid DNA and novel assembly strategies. Genome biology 2014, 15(3):R59.
    OpenUrlCrossRefPubMed
  62. 62.
    Stevens KA, Wegrzyn JL, Zimin A, Puiu D, Crepeau M, Cardeno C, Paul R, Gonzalez-Ibeas D, Koriabine M, Holtz-Morris AE: Sequence of the sugar pine megagenome. Genetics 2016, 204(4):1613–1626.
    OpenUrlAbstract/FREE Full Text
  63. 63.↵
    Birol I, Raymond A, Jackman SD, Pleasance S, Coope R, Taylor GA, Yuen MMS, Keeling CI, Brand D, Vandervalk BP: Assembling the 20 Gb white spruce (Picea glauca) genome from whole-genome shotgun sequencing data. Bioinformatics 2013, 29(12):1492–1497.
    OpenUrlCrossRefPubMedWeb of Science
  64. 64.↵
    Dunemann F, Schrader O, Budahn H, Houben A: Characterization of centromeric histone H3 (CENH3) variants in cultivated and wild carrots (Daucus sp.). Plos one 2014, 9(6):e98504.
    OpenUrl
  65. 65.↵
    Wang L, Deng L: GmACP expression is decreased in GmNORK knockdown transgenic soybean roots. The Crop Journal 2016, 4(6):509–516.
    OpenUrl
  66. 66.↵
    Silvera K, Winter K, Rodriguez BL, Albion RL, Cushman JC: Multiple isoforms of phospho enol pyruvate carboxylase in the Orchidaceae (subtribe Oncidiinae): implications for the evolution of crassulacean acid metabolism. Journal of experimental botany 2014, 65(13):3623–3636.
    OpenUrlCrossRefPubMedWeb of Science
  67. 67.↵
    Fei X, Hou L, Shi J, Yang T, Liu Y, Wei A: Patterns of Drought Response of 38 WRKY Transcription Factors of Zanthoxylum bungeanum Maxim. International journal of molecular sciences 2019, 20(1):68.
    OpenUrl
  68. 68.
    Zhao Y, Cheng X, Liu X, Wu H, Bi H, Xu H: The wheat MYB transcription factor TaMYB31 is involved in drought stress responses in Arabidopsis. Frontiers in plant science 2018, 9:1426.
    OpenUrl
  69. 69.↵
    Waseem M, Li Z: Dissecting the Role of a Basic Helix-Loop-Helix Transcription Factor, SlbHLH22, Under Salt and Drought Stresses in Transgenic Solanum lycopersicum L. Frontiers in plant science 2019, 10:734.
    OpenUrl
  70. 70.↵
    Zhao C, Haigh AM, Holford P, Chen Z-H: Roles of chloroplast retrograde signals and ion transport in plant drought tolerance. International journal of molecular sciences 2018, 19(4):963.
    OpenUrl
  71. 71.↵
    Yoo Y-H, Hong W-J, Jung K-H: A Systematic view exploring the role of chloroplasts in plant abiotic stress responses. BioMed research international 2019, 2019.
  72. 72.↵
    Tuteja N, Mahajan S: Calcium signaling network in plants: an overview. Plant signaling & behavior 2007, 2(2):79–85.
    OpenUrl
  73. 73.
    Takatsuji H: Zinc-finger transcription factors in plants. Cellular and Molecular Life Sciences CMLS 1998, 54(6):582–596.
    OpenUrl
  74. 74.↵
    Agarwal P, Jha B: Transcription factors in plants and ABA dependent and independent abiotic stress signalling. Biologia Plantarum 2010, 54(2):201–212.
    OpenUrl
  75. 75.↵
    Roldán-Arjona T, Ariza RR: Repair and tolerance of oxidative DNA damage in plants. Mutation Research/Reviews in Mutation Research 2009, 681(2-3):169–179.
    OpenUrl
  76. 76.
    Naik PM, Al-Khayri JM: Abiotic and biotic elicitors–role in secondary metabolites production through in vitro culture of medicinal plants. Abiotic and biotic stress in plants-recent advances and future perspectives 2016:247–277.
  77. 77.↵
    Yang Y, Wang W, Chu Z, Zhu J-K, Zhang H: Roles of nuclear pores and nucleo-cytoplasmic trafficking in plant stress responses. Frontiers in plant science 2017, 8:574.
    OpenUrl
  78. 78.↵
    Nisa M-U, Huang Y, Benhamed M, Raynaud C: The plant DNA damage response: Signaling pathways leading to growth inhibition and putative role in response to stress conditions. Frontiers in plant science 2019, 10.
  79. 79.↵
    Arasimowicz M, Floryszak-Wieczorek J: Nitric oxide as a bioactive signalling molecule in plant stress responses. Plant science 2007, 172(5):876–887.
    OpenUrlCrossRefWeb of Science
  80. 80.↵
    Liu Y, Wu R, Wan Q, Xie G, Bi Y: Glucose-6-phosphate dehydrogenase plays a pivotal role in nitric oxide-involved defense against oxidative stress under salt stress in red kidney bean roots. Plant and Cell Physiology 2007, 48(3):511–522.
    OpenUrlCrossRefPubMedWeb of Science
  81. 81.↵
    Bray EA: Molecular responses to water deficit. Plant physiology 1993, 103(4):1035.
    OpenUrlPubMedWeb of Science
  82. 82.↵
    Feng R-J, Ren M-Y, Lu L-F, Peng M, Guan X, Zhou D-B, Zhang M-Y, Qi D-F, Li K, Tang W: Involvement of abscisic acid-responsive element-binding factors in cassava (Manihot esculenta) dehydration stress response. Scientific reports 2019, 9(1):1–12.
    OpenUrl
  83. 83.↵
    Yamaguchi-Shinozaki K, Shinozaki K: Transcriptional regulatory networks in cellular responses and tolerance to dehydration and cold stresses. Annu Rev Plant Biol 2006, 57:781–803.
    OpenUrlCrossRefPubMedWeb of Science
  84. 84.↵
    Van Houtte H, Vandesteene L, López-Galvis L, Lemmens L, Kissel E, Carpentier S, Feil R, Avonce N, Beeckman T, Lunn JE: Overexpression of the trehalase gene AtTRE1 leads to increased drought stress tolerance in Arabidopsis and is involved in abscisic acid-induced stomatal closure. Plant physiology 2013, 161(3):1158–1171.
    OpenUrlAbstract/FREE Full Text
  85. 85.↵
    Teeri J, Tonsor S, Turner M: Leaf thickness and carbon isotope composition in the Crassulaceae. Oecologia 1981, 50(3):367–369.
    OpenUrlCrossRefWeb of Science
  86. 86.↵
    Mallona I, Egea-Cortines M, Weiss J: Conserved and divergent rhythms of crassulacean acid metabolism-related and core clock gene expression in the cactus Opuntia ficus-indica. Plant physiology 2011, 156(4):1978–1989.
    OpenUrlAbstract/FREE Full Text
  87. 87.↵
    Silvera K, Neubig KM, Whitten WM, Williams NH, Winter K, Cushman JC: Evolution along the crassulacean acid metabolism continuum. Functional Plant Biology 2010, 37(11):995–1010.
    OpenUrl
  88. 88.↵
    Santner A, Estelle M: Recent advances and emerging trends in plant hormone signalling. nature 2009, 459(7250):1071–1078.
    OpenUrlCrossRefPubMedWeb of Science
  89. 89.↵
    Cao Y, Luo Q, Tian Y, Meng F: Physiological and proteomic analyses of the drought stress response in Amygdalus Mira (Koehne) Yü et Lu roots. BMC plant biology 2017, 17(1):53.
    OpenUrl
  90. 90.↵
    Janiak A, Kwasniewski M, Sowa M, Gajek K, Żmuda K, Kościelniak J, Szarejko I: No time to waste: Transcriptome study reveals that drought tolerance in barley may be attributed to stressed-like expression patterns that exist before the occurrence of stress. Frontiers in plant science 2018, 8:2212.
    OpenUrl
  91. 91.↵
    Robles P, Quesada V: Transcriptional and post-transcriptional regulation of organellar gene expression (OGE) and its roles in plant salt tolerance. International journal of molecular sciences 2019, 20(5):1056.
    OpenUrl
  92. 92.↵
    Leister D, Wang L, Kleine T: Organellar gene expression and acclimation of plants to environmental stress. Frontiers in plant science 2017, 8:387.
    OpenUrl
  93. 93.↵
    Iqbal MJ: Role of osmolytes and antioxidant enzymes for drought tolerance in wheat. Global Wheat Production 2018:51.
  94. 94.↵
    Jarzyniak KM, Jasiński M: Membrane transporters and drought resistance–a complex issue. Frontiers in plant science 2014, 5:687.
    OpenUrl
  95. 95.↵
    Ranjan A, Sawant S: Genome-wide transcriptomic comparison of cotton (Gossypium herbaceum) leaf and root under drought stress. 3 Biotech 2015, 5(4):585–596.
    OpenUrl
  96. 96.
    Liu WC, Li YH, Yuan HM, Zhang BL, Zhai S, Lu YT: WD40-REPEAT 5a functions in drought stress tolerance by regulating nitric oxide accumulation in Arabidopsis. Plant, cell & environment 2017, 40(4):543–552.
    OpenUrl
  97. 97.↵
    Feyissa BA, Arshad M, Gruber MY, Kohalmi SE, Hannoufa A: The interplay between miR156/SPL13 and DFR/WD40–1 regulate drought tolerance in alfalfa. BMC plant biology 2019, 19(1):1–19.
    OpenUrl
  98. 98.↵
    Tian F, Gong J, Zhang J, Zhang M, Wang G, Li A, Wang W: Enhanced stability of thylakoid membrane proteins and antioxidant competence contribute to drought stress resistance in the tasg1 wheat stay-green mutant. Journal of experimental botany 2013, 64(6):1509–1520.
    OpenUrlCrossRefPubMedWeb of Science
  99. 99.↵
    Shinozaki K, Yamaguchi-Shinozaki K: Gene networks involved in drought stress response and tolerance. Journal of experimental botany 2007, 58(2):221–227.
    OpenUrlCrossRefPubMedWeb of Science
  100. 100.↵
    Tiwari S, Lata C, Singh Chauhan P, Prasad V, Prasad M: A functional genomic perspective on drought signalling and its crosstalk with phytohormone-mediated signalling pathways in plants. Current genomics 2017, 18(6):469–482.
    OpenUrl
  101. 101.↵
    Monroe JG, Powell T, Price N, Mullen JL, Howard A, Evans K, Lovell JT, McKay JK: Drought adaptation in Arabidopsis thaliana by extensive genetic loss-of-function. Elife 2018, 7:e41038.
    OpenUrlCrossRef
  102. 102.↵
    Song K, Kim HC, Shin S, Kim K-H, Moon J-C, Kim JY, Lee B-M: Transcriptome analysis of flowering time genes under drought stress in maize leaves. Frontiers in plant science 2017, 8:267.
    OpenUrl
  103. 103.↵
    Huerta C, Freire M, Cardemil L: Expression of hsp70, hsp100 and ubiquitin in Aloe barbadensis Miller under direct heat stress and under temperature acclimation conditions. Plant cell reports 2013, 32(2):293–307.
    OpenUrl
  104. 104.↵
    Hazrati S, Tahmasebi-Sarvestani Z, Mokhtassi-Bidgoli A, Modarres-Sanavy SAM, Mohammadi H, Nicola S: Effects of zeolite and water stress on growth, yield and chemical compositions of Aloe vera L. Agricultural Water Management 2017, 181:66–72.
    OpenUrl
  105. 105.↵
    Ming R, VanBuren R, Wai CM, Tang H, Schatz MC, Bowers JE, Lyons E, Wang M-L, Chen J, Biggers E: The pineapple genome and the evolution of CAM photosynthesis. Nature genetics 2015, 47(12):1435–1442.
    OpenUrlCrossRefPubMed
  106. 106.↵
    Yin H, Guo H-B, Weston DJ, Borland AM, Ranjan P, Abraham PE, Jawdy SS, Wachira J, Tuskan GA, Tschaplinski TJ: Diel rewiring and positive selection of ancient plant proteins enabled evolution of CAM photosynthesis in Agave. BMC genomics 2018, 19(1):588.
    OpenUrlCrossRef
  107. 107.↵
    Hartwell J: The circadian clock in CAM plants. Annual Plant Reviews online 2018:211–236.
Back to top
PreviousNext
Posted May 31, 2020.
Download PDF

Supplementary Material

Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
The genome sequence of Aloe vera reveals adaptive evolution of drought tolerance mechanisms
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
The genome sequence of Aloe vera reveals adaptive evolution of drought tolerance mechanisms
Shubham K. Jaiswal, Abhisek Chakraborty, Shruti Mahajan, Sudhir Kumar, Vineet K. Sharma
bioRxiv 2020.05.29.122895; doi: https://doi.org/10.1101/2020.05.29.122895
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
The genome sequence of Aloe vera reveals adaptive evolution of drought tolerance mechanisms
Shubham K. Jaiswal, Abhisek Chakraborty, Shruti Mahajan, Sudhir Kumar, Vineet K. Sharma
bioRxiv 2020.05.29.122895; doi: https://doi.org/10.1101/2020.05.29.122895

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genomics
Subject Areas
All Articles
  • Animal Behavior and Cognition (4684)
  • Biochemistry (10361)
  • Bioengineering (7681)
  • Bioinformatics (26339)
  • Biophysics (13534)
  • Cancer Biology (10692)
  • Cell Biology (15445)
  • Clinical Trials (138)
  • Developmental Biology (8500)
  • Ecology (12824)
  • Epidemiology (2067)
  • Evolutionary Biology (16867)
  • Genetics (11401)
  • Genomics (15483)
  • Immunology (10619)
  • Microbiology (25223)
  • Molecular Biology (10225)
  • Neuroscience (54479)
  • Paleontology (402)
  • Pathology (1669)
  • Pharmacology and Toxicology (2897)
  • Physiology (4345)
  • Plant Biology (9251)
  • Scientific Communication and Education (1586)
  • Synthetic Biology (2558)
  • Systems Biology (6781)
  • Zoology (1466)