clearmatch
Clearmatch is a package for matching records from one dataset to another by using a key, which has reference records. If the records to be matched to are synonyms of a reference record in the key that record is matched with its reference. Clearmatch also makes it easy to see summary statistics and generate bar plots of missingness.
Dependencies: Matplotlib, Numpy, Pandas
Installation
Creating ClearMatch objects from DataFrames
Defining the lookup structures for matching
Partitioning the host DataFrame based on unique values in a given column *Note that the resulting DataFrames are returned in a dictionary, so you should use the ['name'] convention to access the DataFrames