pandance

Advanced relational operations for pandas DataFrames


Keywords
data-analysis, data-science, data-wrangling, pandas
License
BSD-3-Clause
Install
pip install pandance==0.3.0

Documentation


Pandance


Pandance provides advanced relational operations for pandas DataFrames, enabling powerful and efficient joins (aka merges).

Highlights

Pandance extends the set of standard join operations in pandas (inner, outer, cross, left, right) with:

  • fuzzy joins: Match columns with a tolerance. Supports numerical and datetime values.
  • inequality join: Match one column's values that are less / greater than the other column's values.
  • theta joins: Allows the user to specify arbitrary matching conditions on which to join

Pandance is designed with performance in mind, aiming to provide fast implementations whenever possible.

Installation

pip install pandance

Usage

See the documentation