Display options
Share it on

Molecules. 2018 May 23;23(6). doi: 10.3390/molecules23061242.

Substructural Connectivity Fingerprint and Extreme Entropy Machines-A New Method of Compound Representation and Analysis.

Molecules (Basel, Switzerland)

Krzysztof Rataj, Wojciech Czarnecki, Sabina Podlewska, Agnieszka Pocha, Andrzej J Bojarski

Affiliations

  1. Institute of Pharmacology, Polish Academy of Sciences, Department of Medicinal Chemistry, Sm?tna Street 12, 31-343 Kraków, Poland. [email protected].
  2. Faculty of Mathematics and Computer Science, Jagiellonian University, ?ojasiewicza Street 6, 30-348 Kraków, Poland. [email protected].
  3. Institute of Pharmacology, Polish Academy of Sciences, Department of Medicinal Chemistry, Sm?tna Street 12, 31-343 Kraków, Poland. [email protected].
  4. Faculty of Mathematics and Computer Science, Jagiellonian University, ?ojasiewicza Street 6, 30-348 Kraków, Poland. [email protected].
  5. Institute of Pharmacology, Polish Academy of Sciences, Department of Medicinal Chemistry, Sm?tna Street 12, 31-343 Kraków, Poland. [email protected].

PMID: 29789513 PMCID: PMC6100401 DOI: 10.3390/molecules23061242

Abstract

Key-based substructural fingerprints are an important element of computer-aided drug design techniques. The usefulness of the fingerprints in filtering compound databases is invaluable, as they allow for the quick rejection of molecules with a low probability of being active. However, this method is flawed, as it does not consider the connections between substructures. After changing the connections between particular chemical moieties, the fingerprint representation of the compound remains the same, which leads to difficulties in distinguishing between active and inactive compounds. In this study, we present a new method of compound representation-substructural connectivity fingerprints (SCFP), providing information not only about the presence of particular substructures in the molecule but also additional data on substructure connections. Such representation was analyzed by the recently developed methodology-extreme entropy machines (EEM). The SCFP can be a valuable addition to virtual screening tools, as it represents compound structure with greater detail and more specificity, allowing for more accurate classification.

Keywords: fingerprint; machine learning; molecular representation; substructures

References

  1. Bioinformatics. 2008 Nov 1;24(21):2518-25 - PubMed
  2. J Chem Inf Model. 2006 Nov-Dec;46(6):2423-31 - PubMed
  3. J Comput Chem. 2011 May;32(7):1466-74 - PubMed
  4. Bioorg Med Chem Lett. 2014 Jan 15;24(2):580-5 - PubMed
  5. PLoS One. 2013 Apr 16;8(4):e61007 - PubMed
  6. Curr Protein Pept Sci. 2007 Aug;8(4):329-51 - PubMed
  7. Nucleic Acids Res. 2014 Jan;42(Database issue):D1083-90 - PubMed
  8. J Chem Inf Model. 2014 Mar 24;54(3):933-43 - PubMed

Substances

MeSH terms

Publication Types