Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

A machine-readable specification for genomics assays

View ORCID ProfileA. Sina Booeshaghi, View ORCID ProfileXi Chen, View ORCID ProfileLior Pachter
doi: https://doi.org/10.1101/2023.03.17.533215
A. Sina Booeshaghi
1Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for A. Sina Booeshaghi
  • For correspondence: abooesha@caltech.edu lpachter@caltech.edu
Xi Chen
2School of Life Sciences, Southern University of Science and Technology, Shenzhen, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Xi Chen
Lior Pachter
1Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California
3Department of Computing and Mathematical Sciences, California Institute of Technology, Pasadena, California
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Lior Pachter
  • For correspondence: abooesha@caltech.edu lpachter@caltech.edu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

Abstract

Understanding the structure of sequenced fragments from genomics libraries is essential for accurate read preprocessing. Currently, different assays and sequencing technologies require custom scripts and programs that do not leverage the common structure of sequence elements present in genomics libraries. We present seqspec, a machine-readable specification for libraries produced by genomics assays that facilitates standardization of preprocessing and enables tracking and comparison of genomics assays. The specification and associated seqspec command line tool is available at https://github.com/IGVF/seqspec.

Competing Interest Statement

The authors have declared no competing interest.

Footnotes

  • https://github.com/igvf/seqspec

  • https://igvf.github.io/seqspec/

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
Back to top
PreviousNext
Posted March 21, 2023.
Download PDF
Data/Code
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
A machine-readable specification for genomics assays
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
A machine-readable specification for genomics assays
A. Sina Booeshaghi, Xi Chen, Lior Pachter
bioRxiv 2023.03.17.533215; doi: https://doi.org/10.1101/2023.03.17.533215
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
A machine-readable specification for genomics assays
A. Sina Booeshaghi, Xi Chen, Lior Pachter
bioRxiv 2023.03.17.533215; doi: https://doi.org/10.1101/2023.03.17.533215

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Bioinformatics
Subject Areas
All Articles
  • Animal Behavior and Cognition (4395)
  • Biochemistry (9619)
  • Bioengineering (7111)
  • Bioinformatics (24915)
  • Biophysics (12642)
  • Cancer Biology (9979)
  • Cell Biology (14387)
  • Clinical Trials (138)
  • Developmental Biology (7968)
  • Ecology (12135)
  • Epidemiology (2067)
  • Evolutionary Biology (16010)
  • Genetics (10937)
  • Genomics (14764)
  • Immunology (9889)
  • Microbiology (23719)
  • Molecular Biology (9493)
  • Neuroscience (50965)
  • Paleontology (370)
  • Pathology (1544)
  • Pharmacology and Toxicology (2688)
  • Physiology (4031)
  • Plant Biology (8683)
  • Scientific Communication and Education (1512)
  • Synthetic Biology (2403)
  • Systems Biology (6446)
  • Zoology (1346)