Journal of Chemical Information and Modeling 2018-02-22

Optimal HTS Fingerprint Definitions by Using a Desirability Function and a Genetic Algorithm

Alvaro Cortes Cabrera, Paula M. Petrone

Index: 10.1021/acs.jcim.7b00447

Full Text: HTML

Abstract

The use of compound biological fingerprints built on data from high-throughput screening (HTS) campaigns, or HTS fingerprints, is a novel cheminformatics method of representing compounds by integrating chemical and biological activity data that is gaining momentum in its application to drug discovery, including hit expansion, target identification, and virtual screening. HTS fingerprints present two major limitations, noise and missing data, which are intrinsic to the high-throughput data acquisition technologies and to the assay availability or assay selection procedure used for their construction. In this work, we present a methodology to define an optimal set of HTS fingerprints by using a desirability function that encodes the principles of maximum biological and chemical space coverage and minimum redundancy between HTS assays. We used a genetic algorithm to optimize the desirability function and obtained an optimal fingerprint that was evaluated for performance in a test set of 33 diverse assays. Our results show that the optimal HTS fingerprint represents compounds in chemical biology space using 25% fewer assays. When used for virtual screening, the optimal HTS fingerprint obtained equivalent performance, in terms of both area under the curve and enrichment factors, to full fingerprints for 27 out of 33 test assays, while randomly assembled fingerpints could achieve equivalent performance in only 23 test assays.

Latest Articles:

Holistic Approach to Partial Covalent Interactions in Protein Structure Prediction and Design with Rosetta

2018-04-19

[10.1021/acs.jcim.7b00398]

Force Field Benchmark of Amino Acids: I. Hydration and Diffusion in Different Water Models

2018-04-18

[10.1021/acs.jcim.8b00026]

Role of Molecular Interactions and Protein Rearrangement in the Dissociation Kinetics of p38α MAP Kinase Type-I/II/III Inhibitors

2018-04-16

[10.1021/acs.jcim.7b00640]

Peptidic Macrocycles - Conformational Sampling and Thermodynamic Characterization

2018-04-13

[10.1021/acs.jcim.8b00097]

ReFlex3D: Refined Flexible Alignment of Molecules Using Shape and Electrostatics

2018-04-13

[10.1021/acs.jcim.7b00618]

More Articles...