outdpik

The fundamental toolkit for outliers search and visualization


Keywords
OUTLIERS, VISUALIZATION, PANDAS, data-analysis, matplotlib, numpy, python
License
GPL-3.0
Install
pip install outdpik==1.5

Documentation


outdpik: Fundamental toolkit for outlier analysis and visualization

PyPI Latest Release PyPI License Package Status Code style: black

What is it?

Outdpik is an open source Python package that provides different methods for outlier detection. It aims to be the fundamental high-level package for this purpose. Additionally, it offers visualization methods for the outlier analysis.

Main Features

Here are just a few of the things that outdpik does well:

  • It supports numpy arrays and pandas dataframes
  • Multiple outlier detection techniques that can be combined
  • Powerful visualizations
  • Flexible at including one or more columns for the analysis

Where to get it

The source code is currently hosted on GitHub at: https://github.com/DanielPuentee/outdpik

Installer for the latest released version is available at the Python Package Index (PyPI)

# PyPI
pip install outdpik

How to use outdpik 🤙

Examples of configuring and running outpdik:

import outpdik as outdp
outdp = outdp()

We proceed to detect outliers returning a dictionary of numeric features and the outliers instances:

outliers_dict = outdp.outliers(df = df, cols = "all")

Plotting advantages:

outdp.plot_outliers(df = df, col = "x")

Strip plot outliers detection

Dependencies

License

This project is licensed under the terms of the MIT - see the LICENSE file for details.

Documentation

The official documentation is hosted on FALTA: https://pandas.pydata.org/pandas-docs/stable

Development

Want to contribute? Great! Open a discussion in Github in this repo and we will answer as soon as possible.