ABRA: improved coding indel detection via assembly-based realignment

Bioinformatics. 2014 Oct;30(19):2813-5. doi: 10.1093/bioinformatics/btu376. Epub 2014 Jun 6.

Abstract

Motivation: Variant detection from next-generation sequencing (NGS) data is an increasingly vital aspect of disease diagnosis, treatment and research. Commonly used NGS-variant analysis tools generally rely on accurately mapped short reads to identify somatic variants and germ-line genotypes. Existing NGS read mappers have difficulty accurately mapping short reads containing complex variation (i.e. more than a single base change), thus making identification of such variants difficult or impossible. Insertions and deletions (indels) in particular have been an area of great difficulty. Indels are frequent and can have substantial impact on function, which makes their detection all the more imperative.

Results: We present ABRA, an assembly-based realigner, which uses an efficient and flexible localized de novo assembly followed by global realignment to more accurately remap reads. This results in enhanced performance for indel detection as well as improved accuracy in variant allele frequency estimation.

Availability and implementation: ABRA is implemented in a combination of Java and C/C++ and is freely available for download at https://github.com/mozack/abra.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms
  • Computational Biology / methods*
  • Gene Frequency
  • Genome, Human
  • Genotype
  • High-Throughput Nucleotide Sequencing
  • Humans
  • INDEL Mutation*
  • Programming Languages
  • Sequence Alignment / methods*
  • Sequence Analysis, DNA / methods*
  • Software