Similarity searching using 2D structural fingerprints

Methods Mol Biol. 2011:672:133-58. doi: 10.1007/978-1-60761-839-3_5.

Abstract

This chapter reviews the use of molecular fingerprints for chemical similarity searching. The fingerprints encode the presence of 2D substructural fragments in a molecule, and the similarity between a pair of molecules is a function of the number of fragments that they have in common. Although this provides a very simple way of estimating the degree of structural similarity between two molecules, it has been found to provide an effective and an efficient tool for searching large chemical databases. The review describes the historical development of similarity searching since it was first described in the mid-1980s, reviews the many different coefficients, representations, and weightings that can be combined to form a similarity measure, describes quantitative measures of the effectiveness of similarity searching, and concludes by looking at current developments based on the use of data fusion and machine learning techniques.

Publication types

  • Review

MeSH terms

  • Artificial Intelligence
  • Computational Biology / methods*
  • Computer Simulation
  • Databases, Factual
  • Humans
  • Informatics
  • Molecular Structure
  • Structure-Activity Relationship