Genome-wide identification and initial characterization of bovine long non-coding RNAs from EST data

Anim Genet. 2012 Dec;43(6):674-82. doi: 10.1111/j.1365-2052.2012.02325.x. Epub 2012 Feb 8.

Abstract

It has become increasingly clear that the mammalian genomes produce many long non-coding RNAs (lncRNAs). Accumulating evidence suggests important functions for lncRNAs in a variety of biological processes. However, little is known about lncRNA identity and characteristics in cattle. Using public bovine-specific expressed sequence tags sequences, we reconstructed transcript assemblies, from which reference sequences were obtained for RNAs. Intergenic regions with evidence of transcription were screened for putative lncRNAs using the combination of a gene-finding program and a support vector machine-based tool for the calculation of protein-coding potential. A total of 449 putative lncRNAs located in 405 intergenic regions were identified. Characterization of these putative bovine lncRNAs suggests that they are generally expressed in a tissue-specific manner, their GC contents are higher than randomly selected intergenic sequences but are lower than protein-coding genes, and they are moderately conserved among mammals. This is the first genome-wide catalogue of putative intergenic lncRNAs in cattle and provides important targets for functional studies.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Animals
  • Base Composition
  • Base Sequence
  • Cattle / genetics*
  • DNA, Intergenic / genetics*
  • Expressed Sequence Tags*
  • Genetic Variation
  • Genome
  • Polymorphism, Single Nucleotide
  • RNA, Long Noncoding / genetics*
  • Sequence Alignment / veterinary

Substances

  • DNA, Intergenic
  • RNA, Long Noncoding