PULDB: the expanded database of Polysaccharide Utilization Loci

Nucleic Acids Res. 2018 Jan 4;46(D1):D677-D683. doi: 10.1093/nar/gkx1022.

Abstract

The Polysaccharide Utilization Loci (PUL) database was launched in 2015 to present PUL predictions in ∼70 Bacteroidetes species isolated from the human gastrointestinal tract, as well as PULs derived from the experimental data reported in the literature. In 2018 PULDB offers access to 820 genomes, sampled from various environments and covering a much wider taxonomical range. A Krona dynamic chart was set up to facilitate browsing through taxonomy. Literature surveys now allows the presentation of the most recent (i) PUL repertoires deduced from RNAseq large-scale experiments, (ii) PULs that have been subjected to in-depth biochemical analysis and (iii) new Carbohydrate-Active enzyme (CAZyme) families that contributed to the refinement of PUL predictions. To improve PUL visualization and genome browsing, the previous annotation of genes encoding CAZymes, regulators, integrases and SusCD has now been expanded to include functionally relevant protein families whose genes are significantly found in the vicinity of PULs: sulfatases, proteases, ROK repressors, epimerases and ATP-Binding Cassette and Major Facilitator Superfamily transporters. To cope with cases where susCD may be absent due to incomplete assemblies/split PULs, we present 'CAZyme cluster' predictions. Finally, a PUL alignment tool, operating on the tagged families instead of amino-acid sequences, was integrated to retrieve PULs similar to a query of interest. The updated PULDB website is accessible at www.cazy.org/PULDB_new/.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bacterial Proteins / genetics
  • Bacterial Proteins / metabolism*
  • Bacteroidetes / classification
  • Bacteroidetes / genetics
  • Bacteroidetes / metabolism*
  • Biological Transport / genetics
  • Carrier Proteins / genetics
  • Carrier Proteins / metabolism
  • Chlorobi / classification
  • Chlorobi / genetics
  • Chlorobi / metabolism
  • Databases, Chemical*
  • Databases, Genetic*
  • Energy Metabolism / genetics
  • Enzymes / genetics
  • Enzymes / metabolism
  • Evolution, Molecular
  • Fibrobacteres / classification
  • Fibrobacteres / genetics
  • Fibrobacteres / metabolism
  • Gene Expression Regulation, Bacterial
  • Genes, Bacterial*
  • Molecular Sequence Annotation
  • Multigene Family
  • Operon / genetics*
  • Polysaccharides / metabolism*
  • RNA, Bacterial / genetics
  • Sequence Alignment
  • Species Specificity

Substances

  • Bacterial Proteins
  • Carrier Proteins
  • Enzymes
  • Polysaccharides
  • RNA, Bacterial