A filter tool for HTS and VS

python, cheminformatics, rdkit, drug-design, conda-packages, pypi-package, compounds-filter
pip install scopy==1.2.5


Scopy: An integrated negative design Python library for desirable HTS/VS database design

Travis (.com) codecov GitHub last commit Conda PyPI MIT License Blog Kouhai DOI


Scopy (Screnning COmpounds in PYthon), an integrated negative design python library designed for screening out undesirable compounds in the early drug discovery. Scopy includes six modules, covering data preparation, screening filters, the calculation of scaffolds and descriptors, and the visualization analysis.


Install RDKit

>>> conda install -c conda-forge rdkit

Install Scopy

Scopy has been successfully tested on Linux, OSX and Windows systems under Python3 enviroment.


>>> git clone && cd scopy
>>> [sudo] python install


>>> pip install scopy


(1)The online version of the documentation is available here:
(2)Quick start examples:
(3)Application examples(pipelines):


If you have questions or suggestions, please contact:,and
Please see the file LICENSE for details about the "MIT" license which covers this software and its associated data and documents.

Cite us

Yang ZY, Yang ZJ, Lu AP, Hou TJ, Cao DS. Scopy: an integrated negative design python library for desirable HTS/VS database design [published online ahead of print, 2020 Sep 7]. Brief Bioinform. 2020;bbaa194. doi:10.1093/bib/bbaa194

    author = {Yang, Zi-Yi and Yang, Zhi-Jiang and Lu, Ai-Ping and Hou, Ting-Jun and Cao, Dong-Sheng},
    title = "{Scopy: an integrated negative design python library for desirable HTS/VS database design}",
    journal = {Briefings in Bioinformatics},
    year = {2020},
    month = {09},
    abstract = "{High-throughput screening (HTS) and virtual screening (VS) have been widely used to identify potential hits from large chemical libraries. However, the frequent occurrence of ‘noisy compounds’ in the screened libraries, such as compounds with poor drug-likeness, poor selectivity or potential toxicity, has greatly weakened the enrichment capability of HTS and VS campaigns. Therefore, the development of comprehensive and credible tools to detect noisy compounds from chemical libraries is urgently needed in early stages of drug discovery.In this study, we developed a freely available integrated python library for negative design, called Scopy, which supports the functions of data preparation, calculation of descriptors, scaffolds and screening filters, and data visualization. The current version of Scopy can calculate 39 basic molecular properties, 3 comprehensive molecular evaluation scores, 2 types of molecular scaffolds, 6 types of substructure descriptors and 2 types of fingerprints. A number of important screening rules are also provided by Scopy, including 15 drug-likeness rules (13 drug-likeness rules and 2 building block rules), 8 frequent hitter rules (four assay interference substructure filters and four promiscuous compound substructure filters), and 11 toxicophore filters (five human-related toxicity substructure filters, three environment-related toxicity substructure filters and three comprehensive toxicity substructure filters). Moreover, this library supports four different visualization functions to help users to gain a better understanding of the screened data, including basic feature radar chart, feature-feature-related scatter diagram, functional group marker gram and cloud gram.Scopy provides a comprehensive Python package to filter out compounds with undesirable properties or substructures, which will benefit the design of high-quality chemical libraries for drug design and discovery. It is freely available at}",
    issn = {1477-4054},
    doi = {10.1093/bib/bbaa194},
    url = {},
    note = {bbaa194},
    eprint = {},


Thanks to my colleague, Ziyi, for assisting me to complete the writing of document and article.