simpleppt

Python implementation of SimplePPT algorithm, with GPU acceleration


License
BSD-3-Clause
Install
pip install simpleppt==1.1.4

Documentation

PyPI Documentation Status Build & Test codecov GitHub license Code style: black

SimplePPT

Python implementation of SimplePPT algorithm, with GPU acceleration.

Please cite the following paper if you use it:

Mao et al. (2015), SimplePPT: A simple principal tree algorithm, SIAM International Conference on Data Mining.

Installation

pip install -U simpleppt

Usage

from sklearn.datasets import make_classification
import simpleppt

X1, Y1 = make_classification(n_features=2, n_redundant=0, n_informative=2,
                             n_clusters_per_class=1, n_classes=3)

ppt=simpleppt.ppt(X1,Nodes=30,seed=1,progress=False,lam=10)
simpleppt.project_ppt(ppt,X1, c=Y1)

result

GPU dependencies (optional)

If you have a nvidia GPU, simpleppt can leverage CUDA computations for speedup in tree inference. The latest version of rapids framework is required (at least 0.17) it is recommanded to create a new conda environment:

conda create -n SimplePPT-gpu -c rapidsai -c nvidia -c conda-forge -c defaults \
    rapids=0.19 python=3.8 cudatoolkit=11.0 -y
conda activate SimplePPT-gpu
pip install simpleppt