MolCompassViewer is a part of the MolCompass project. It is a tool that provides a pretrained parametric t-SNE model for chemical space visualization and the visual validation of QSAR/QSPR models.
The package can be installed: pip install molcompview
To run the endocrine receptor example described in the manuscript (coming soon), you may do so by executing the following steps.
molcompview --demo
Then, wait some time for the calculation of coordinates. After that stage, the browser will open, and you can view the interactive visualization of the chemical space.
To run on your own dataset, just use:
molcompview <input.csv>
MolCompassViewer
intelligently identifies the types of columns within the CSV file, selecting an operational mode based on the presence of specific column types, primarily the molecular structures encoded as SMILES strings.
- STRUCTURE ONLY: Activated when only the SMILES column is identified. Focuses on visualizing molecular structures, omitting additional features like color layers and QSAR/QSPR model analyses.
- ALTERNATIVE: Triggered when additional categorical or numerical columns are found alongside the SMILES column, excluding the Ground Truth and Probabilities columns. Focuses on exploring the chemical space of compounds, which does not apply to the analysis of models. Users can customize point colors in the visualization based on selected properties.
- FULL: Dedicated to the visual analysis of binary QSAR/QSPR models. Available when the CSV file comprises SMILES strings along with Ground Truth and predicted probabilities columns, unlocking access to exclusive features for visualizing binary QSAR/QSPR models.
The applicability domain analysis is possible only in FULL mode for binary classification models. To run this tool, first select a tool from the top left corner of the chemical map, then select an area of interest and release. A new frame will open on the right, displaying statistical parameters and charts calculated exclusively for the selected compounds.