A reliable computational workflow for the selection of optimal screening libraries

Yocheved Gilad, Katalin Nadassy, Hanoch Senderowitz

Research output: Contribution to journalArticlepeer-review

20 Scopus citations

Abstract

Background: The experimental screening of compound collections is a common starting point in many drug discovery projects. Successes of such screening campaigns critically depend on the quality of the screened library. Many libraries are currently available from different vendors yet the selection of the optimal screening library for a specific project is challenging. We have devised a novel workflow for the rational selection of project-specific screening libraries. Results: The workflow accepts as input a set of virtual candidate libraries and applies the following steps to each library: (1) data curation; (2) assessment of ADME/T profile; (3) assessment of the number of promiscuous binders/frequent HTS hitters; (4) assessment of internal diversity; (5) assessment of similarity to known active compound(s) (optional); (6) assessment of similarity to in-house or otherwise accessible compound collections (optional). For ADME/T profiling, Lipinski's and Veber's rule-based filters were implemented and a new blood brain barrier permeation model was developed and validated (85 and 74 % success rate for training set and test set, respectively). Diversity and similarity descriptors which demonstrated best performances in terms of their ability to select either diverse or focused sets of compounds from three databases (Drug Bank, CMC and CHEMBL) were identified and used for diversity and similarity assessments. The workflow was used to analyze nine common screening libraries available from six vendors. The results of this analysis are reported for each library providing an assessment of its quality. Furthermore, a consensus approach was developed to combine the results of these analyses into a single score for selecting the optimal library under different scenarios. Conclusions: We have devised and tested a new workflow for the rational selection of screening libraries under different scenarios. The current workflow was implemented using the Pipeline Pilot software yet due to the usage of generic components, it can be easily adapted and reproduced by computational groups interested in rational selection of screening libraries. Furthermore, the workflow could be readily modified to include additional components. This workflow has been routinely used in our laboratory for the selection of libraries in multiple projects and consistently selects libraries which are well balanced across multiple parameters.

Original languageEnglish
Article number61
JournalJournal of Cheminformatics
Volume7
Issue number1
DOIs
StatePublished - 11 Dec 2015

Bibliographical note

Publisher Copyright:
© 2015 Gilad et al.

Funding

This work was supported by a Grant from the ELA Foundation, ELA 2012-013C2 to Y. G. and H. S. Access to the CMC database was generously provided by Accelrys Inc. (http://www.accelrys.com). The Prestwick Chemical Library® was kindly provided by Prestwick Chemical, We are thankful to Dana Honey-cutt from Accelrys Inc. for valuable feedback on this work. We are also thankful to Prof. Alexander Tropsha from the UNC Eshelman School of Pharmacy for many helpful discussions.

FundersFunder number
Accelrys Inc.
ELA FoundationELA 2012-013C2

    Keywords

    • Diversity
    • Fingerprints
    • Library selection
    • QSAR
    • Screening libraries
    • Similarity

    Fingerprint

    Dive into the research topics of 'A reliable computational workflow for the selection of optimal screening libraries'. Together they form a unique fingerprint.

    Cite this