Using video-oriented instructions to speed up sequence comparison

Comput Appl Biosci. 1997 Apr;13(2):145-50. doi: 10.1093/bioinformatics/13.2.145.

Abstract

Motivation: This document presents an implementation of the well-known Smith-Waterman algorithm for comparison of proteic and nucleic sequences, using specialized video instructions. These instructions, SIMD-like in their design, make possible parallelization of the algorithm at the instruction level.

Results: Benchmarks on an ULTRA SPARC running at 167 MHz show a speed-up factor of two compared to the same algorithm implemented with integer instructions on the same machine. Performance reaches over 18 million matrix cells per second on a single processor, giving to our knowledge the fastest implementation of the Smith-Waterman algorithm on a workstation. The accelerated procedure was introduced in LASSAP--a LArge Scale Sequence compArison Package software developed at INRIA--which handles parallelism at higher level. On a SUN Enterprise 6000 server with 12 processors, a speed of nearly 200 million matrix cells per second has been obtained. A sequence of length 300 amino acids is scanned against SWISSPROT R33 (1,8531,385 residues) in 29 s. This procedure is not restricted to databank scanning. It applies to all cases handled by LASSAP (intra- and inter-bank comparisons, Z-score computation, etc.

Publication types

  • Comparative Study

MeSH terms

  • Algorithms
  • Computer Terminals*
  • Computer-Assisted Instruction / methods*
  • Databases, Factual
  • Evaluation Studies as Topic
  • Nucleic Acids / genetics
  • Proteins / genetics
  • Sequence Alignment / methods*
  • Sequence Alignment / statistics & numerical data
  • Sequence Homology, Amino Acid
  • Sequence Homology, Nucleic Acid
  • Software

Substances

  • Nucleic Acids
  • Proteins