Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Long-read transcriptome data of bee fungal parasite, Nosema ceranae

Huazhi Chen, Yu Du, Yuanchan Fan, Haibin Jiang, Cuiling Xiong, Yanzhen Zheng, Dafu Chen, Rui Guo
doi: https://doi.org/10.1101/2020.03.11.987271
Huazhi Chen
College of Animal Sciences (College of Bee Science), Fujian Agriculture and Forestry University, Fuzhou 350002, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yu Du
College of Animal Sciences (College of Bee Science), Fujian Agriculture and Forestry University, Fuzhou 350002, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yuanchan Fan
College of Animal Sciences (College of Bee Science), Fujian Agriculture and Forestry University, Fuzhou 350002, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Haibin Jiang
College of Animal Sciences (College of Bee Science), Fujian Agriculture and Forestry University, Fuzhou 350002, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Cuiling Xiong
College of Animal Sciences (College of Bee Science), Fujian Agriculture and Forestry University, Fuzhou 350002, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yanzhen Zheng
College of Animal Sciences (College of Bee Science), Fujian Agriculture and Forestry University, Fuzhou 350002, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Dafu Chen
College of Animal Sciences (College of Bee Science), Fujian Agriculture and Forestry University, Fuzhou 350002, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Rui Guo
College of Animal Sciences (College of Bee Science), Fujian Agriculture and Forestry University, Fuzhou 350002, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: ruiguo@fafu.edu.cn
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Preview PDF
Loading

ABSTRACT

Nosema ceranae, a widespread fungal parasite that infects honeybee and many other bee species, can seriously affect bee health and colony productivity. In this article, N. ceranae spores were purified followed by third-generation sequencing using Nanopore PromethION platform. Totally, 6988795 raw reads were yielded from purified spores, with a length distribution among 1 kb~10 kb and a quality (Q) score distribution among Q6~Q12. A total of 6953469 clean reads were obtained, and among them 73.98% were identified as being full-length. The length of redundant reads-removed full-length transcripts was ranged from 1 kb to 5 kb, with the most abundant length of 1 kb. These data will improve transcriptome quality of N. ceranae significantly.

Value of the data

  • The dataset reported here allows a deeper understanding of the complexity of Nosema ceranae transcriptome.

  • The accessible data can benefit researchers in identification of genes associated with N. ceranae infection mechanism.

  • This dataset could be used to improve functional annotations of N. ceranae genome and transcriptome.

  • The deposited data contributes to further exploring alternative splicing and polyadenylation of N. ceranae mRNAs.

Data

The shared long-read transcriptome data were derived from purified N. ceranae spores. Based on Nanopore sequencing, 6988795 raw reads were produced, with an average length of 881 bp and a mean N50 of 971 bp (Table 1). As Fig. 1 shown, these raw reads were ranged from 1 kb to 10 kb in length, with the most abundant length of 1 kb. Additionally, the quality (Q) score of majority of raw reads was Q9 (Fig. 2). A total of 6953469 clean reads were obtained, and among them 73.98% were identified as being full-length (Table 2; Fig. 3). After removing redundant reads, the length of remaining full-length transcripts was distributed among 1 kb~5 kb, with the highest percentage of 1 kb (Fig. 4).

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 1

Summary of Nanopore raw reads

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 2

Summary of full-length clean reads

Fig. 1.
  • Download figure
  • Open in new tab
Fig. 1.

Length distribution of clean reads

Fig. 2.
  • Download figure
  • Open in new tab
Fig. 2.

Quality distribution of clean reads

Fig. 3.
  • Download figure
  • Open in new tab
Fig. 3.

Length distribution of full-length clean reads

Fig. 4.
  • Download figure
  • Open in new tab
Fig. 4.

Length distribution of redundant reads-removed full-length transcripts

Experimental Design, Materials, and Methods

2.1 Purification of N. ceranae spores

N. ceranae spores were previously purified following the method described by Cornman et al. [1] with some minor modifications [2]. A bit of spores was subjected to PCR detection and proved to be mono-specific [3] using previously developed specific primers [4]. The purified spores were immediately frozen in liquid nitrogen and stored at − 80 °C until use.

2.2 RNA isolation, cDNA library construction and Nanopore sequencing

Total RNA of N. ceranae spores was isolated using TRizol reagent (Thermo Fisher, China) on dry ice followed by reverse transcription with Maxima H Minus Reverse Transcriptase, and then purified and concentrated on a AMPure XP beads. cDNA libraries were constructed from 50 ng total RNA with a 14 cycles PCR using the cDNA-PCR Sequencing Kit (SQK-PCS109) and PCR Barcoding Kit (SQK-PBK004) according to the manufacturer’s protocol (Oxford Nanopore Technologies Ltd, UK), followed by sequencing by Biomarker Technologies (China) using PromethION platform (Oxford Nanopore Technologies Ltd, UK).

2.3 Long-read processing

Firstly, raw reads were filtered with minimum average read Q score=7 and minimum read length=500bp; ribosomal RNA were discarded after mapping to rRNA database. Secondly, full-length non-chemiric (FLNC) transcripts were determined by searching for primer at both ends of reads. Thirdly, clusters of FLNC transcripts were obtained after mapping to reference genome of N. ceranae (assembly ASM98816v1) with mimimap2 [5], and consensus isoforms were obtained after polishing within each cluster by pinfish (https://github.com/nanoporetech/pinfish); consensus sequences were mapped to reference genome (assembly ASM98816v1) using minimap2. Fourthly, mapped reads were further collapsed by cDNA_Cupcake package (https://github.com/Magdoll/cDNA_Cupcake) with min-coverage=85% and min-identity=90%; 5’ difference was not considered when collapsing redundant transcripts.

Acknowledgments

This research was supported by the Earmarked Fund for China Agriculture Research System (No. CARS-44-KXJ7), the Science and Technology Planning Project of Fujian Province (No. 2018J05042), the Teaching and Scientific Research Fund of Education Department of Fujian Province (No. JAT170158), the Outstanding Scientific Research Manpower Fund of Fujian Agriculture and Forestry University (No. xjq201814), and the Scientific and Technical Innovation Fund of Fujian Agriculture and Forestry University (No. CXZX2017342, No. CXZX2017343).

Reference

  1. [1].↵
    R.S. Cornman, Y.P. Chen, M.C. Schatz, C. Street, Y. Zhao, B. Desany, M. Egholm, S. Hutchison, J.S. Pettis, W.L. Lipkin, J.D. Evans. (2009) Genomic analyses of the microsporidian Nosema ceranae, an emergent pathogen of honey bees. PLoS Pathog. 5(2009) e1000466.
    OpenUrlCrossRefPubMed
  2. [2].↵
    H.Z. Chen, Y. Du, C.L. Xiong, Y.Z. Zheng, D.F. Chen, R. Guo. A comprehensive transcriptome data of normal and Nosema ceranae -stressed midguts of Apis mellifera ligustica workers. Data in Brief (2019) 26.
  3. [3].↵
    Yu Du, D.D. Zhou, H.Z. Chen, C.L. Xiong, Y.Z. Zheng, D.F. Chen, R. Guo. MicroRNA dataset of normal and Nosema ceranae-infected midguts of Apis cerana cerana workers. Data in Brief (2019) 26.
  4. [4].↵
    Y. Chen, J.D. Evans, I.B. Smith, J.S. Pettis. (2008) Nosema ceranae, is a long-present and wide-spread microsporidian infection of the European honey bee (Apis mellifera) in the United States. J. Invertebr. Pathol. 97(2008) 186–188.
    OpenUrlCrossRefPubMed
  5. [5].↵
    H. Li; Minimap2: pairwise alignment for nucleotide sequences, Bioinformatics, 34(2018) 3094–3100.
    OpenUrlCrossRefPubMed
Back to top
PreviousNext
Posted March 12, 2020.
Download PDF
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Long-read transcriptome data of bee fungal parasite, Nosema ceranae
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Long-read transcriptome data of bee fungal parasite, Nosema ceranae
Huazhi Chen, Yu Du, Yuanchan Fan, Haibin Jiang, Cuiling Xiong, Yanzhen Zheng, Dafu Chen, Rui Guo
bioRxiv 2020.03.11.987271; doi: https://doi.org/10.1101/2020.03.11.987271
Digg logo Reddit logo Twitter logo Facebook logo Google logo LinkedIn logo Mendeley logo
Citation Tools
Long-read transcriptome data of bee fungal parasite, Nosema ceranae
Huazhi Chen, Yu Du, Yuanchan Fan, Haibin Jiang, Cuiling Xiong, Yanzhen Zheng, Dafu Chen, Rui Guo
bioRxiv 2020.03.11.987271; doi: https://doi.org/10.1101/2020.03.11.987271

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Bioinformatics
Subject Areas
All Articles
  • Animal Behavior and Cognition (3482)
  • Biochemistry (7329)
  • Bioengineering (5301)
  • Bioinformatics (20212)
  • Biophysics (9985)
  • Cancer Biology (7706)
  • Cell Biology (11273)
  • Clinical Trials (138)
  • Developmental Biology (6425)
  • Ecology (9923)
  • Epidemiology (2065)
  • Evolutionary Biology (13292)
  • Genetics (9353)
  • Genomics (12559)
  • Immunology (7681)
  • Microbiology (18964)
  • Molecular Biology (7421)
  • Neuroscience (40915)
  • Paleontology (298)
  • Pathology (1226)
  • Pharmacology and Toxicology (2130)
  • Physiology (3145)
  • Plant Biology (6842)
  • Scientific Communication and Education (1271)
  • Synthetic Biology (1893)
  • Systems Biology (5299)
  • Zoology (1086)