Prediction Intervals for Trees using Conformal Intervals
Introduction
The basic idea of inductive conformal intervals is to use a calibration set to learn a given quantile of the error distribution on that set. This quantile is used as the basis for prediction intervals on new data.
However this is often not especially useful in practice as every new prediction will recieve the same interval. Instead we want to scale this interval according to the input data. Intuitively we want to increase the interval where we have less confidence about the data and associated prediction and decrease it where we have more confidence.
In order to produce a scaling factor value captures the confidence or familiarity we have with some data compared to our calibration set, pitci
uses the number of training data rows that fell into the specific leaf nodes that were visited in making the prediction, summed across all trees.
For a full list of the supported libraries and more detail on the methods implmeneted, see the docs.
Install
The easiest way to get pitci
is directly from PyPI using;
pip install pitci
Documentation
The documentation for pitci
can be found here.
For information on how to build the documentation locally see the docs README.
Examples
There are various example notebooks demonstrating how to use the package in the examples folder in the repo.
Build
pitci
uses flit as the package build tool.
To install pitci
for development, use the following commands from the root directory;
pip install "flit>=2,<4"
flit install
The default deps
flag for flit
is all
so this will install all of the libraries required for testing and creating the docs.
To install pitci
in editable mode (i.e. the equivalent of pip install . -e
) use the symlink
flag;
flit install --symlink
See the flit docs for all the command line options for flit
.