Fast construction of FM-index for long sequence reads

Heng Li

doi:10.1093/bioinformatics/btu541

Fast construction of FM-index for long sequence reads

Bioinformatics. 2014 Nov 15;30(22):3274-5. doi: 10.1093/bioinformatics/btu541. Epub 2014 Aug 8.

Author

Heng Li¹

Affiliation

¹ Medical Population Genetics Program, Broad Institute, 75 Ames Street, Cambridge, MA 02142, USA.

Abstract

Summary: We present a new method to incrementally construct the FM-index for both short and long sequence reads, up to the size of a genome. It is the first algorithm that can build the index while implicitly sorting the sequences in the reverse (complement) lexicographical order without a separate sorting step. The implementation is among the fastest for indexing short reads and the only one that practically works for reads of averaged kilobases in length.

Availability and implementation: https://github.com/lh3/ropebwt2 CONTACT: hengli@broadinstitute.org.

Fast construction of FM-index for long sequence reads

Author

Affiliation

Abstract

Publication types

MeSH terms

Grants and funding