Abstract
Understanding the structure of sequenced fragments from genomics libraries is essential for accurate read preprocessing. Currently, different assays and sequencing technologies require custom scripts and programs that do not leverage the common structure of sequence elements present in genomics libraries. We present seqspec, a machine-readable specification for libraries produced by genomics assays that facilitates standardization of preprocessing and enables tracking and comparison of genomics assays. The specification and associated seqspec command line tool is available at https://github.com/IGVF/seqspec.
Competing Interest Statement
The authors have declared no competing interest.
Footnotes
Added supplemental figure for seqspec print (added citation for it in text) Updated functional descriptions for seqspec cli Modified figure 2 to add seqspec onlist functionality