Abstract
Background The tufted duck is a non-model organism that suffers high mortality in highly pathogenic avian influenza out-breaks. It belongs to the same bird family (Anatidae) as the mallard, one of the best-studied natural hosts of low-pathogenic avian influenza viruses. Studies in non-model bird species are crucial to disentangle the role of the host response in avian influenza virus infection in the natural reservoir. Such endeavour requires a high-quality genome assembly and transcriptome.
Results This study presents the first high-quality, chromosome-level reference genome assembly of the tufted duck using the Vertebrate Genomes Project pipeline. We sequenced RNA (cDNA) from brain, ileum, lung, ovary, spleen and testis using Illumina short-read and PacBio long-read sequencing platforms, which was used for annotation. We found 34 autosomes plus Z and W sex chromosomes in the curated genome assembly, with 99.6% of the sequence assigned to chromosomes. Functional annotation revealed 14,099 protein-coding genes that generate 111,934 transcripts, which implies an average of 7.9 isoforms per gene. We also identified 246 small RNA families.
Conclusions This annotated genome contributes to continuing research into the host response in avian influenza virus infections in a natural reservoir. Our findings from a comparison between short-read and long-read reference transcriptomics contribute to a deeper understanding of these competing options. In this study, both technologies complemented each other. We expect this annotation to be a foundation for further comparative and evolutionary genomic studies, including many waterfowl relatives with differing susceptibilities to the avian influenza virus.
Competing Interest Statement
The authors have declared no competing interest.
Abbreviations
- AIV
- Avian influenza A virus |
- BSL
- Biosafety level |
- CLR
- Continuous long reads |
- DLS
- Direct label and stain |
- FLNC
- Full-length, non-chimeric reads |
- HPAI
- Highly pathogenic avian influenza |
- NGS
- Next-generation sequencing |
- ORF
- Open reading frame |
- SMRT
- Single-molecule, real-time |
- TAMA
- Transcriptome annotation by modular algorithms |
- VGL
- Vertebrate Genomes Lab |
- VGP
- Vertebrate Genomes Project |
- ZMW
- Zero-mode waveguide