string_set_matcher

determine a best fuzzy match between sets of strings


Keywords
set, string, fuzzy, matching
License
MIT
Install
pip install string_set_matcher==0.1.0

Documentation

String Set Matcher

String set matcher is a simple toolkit for finding the maximal match mapping between two sets of strings. This can be used to automatically map headers between CSV-type data files, or for other similar purposes.

Requirements

Installation

pip install string_set_matcher

Usage

>>>import string_set_matcher as ssm

match

>>>ssm.match(frm, onto)

where frm and onto are strictly lists of python strings.

StringSetMatcher will attempt to match and provide scores for all possible permutations of frm provided against onto, returning results in a

[ (mapping, score) ]

format, where "mapping" is an array such that

mapping[i]

is the location that the ith element of frm mapped in onto.