Save the date! Upstream is on June 5! 🎉

pytokenjoin
Release 0.1.8

pyTokenJoin is a library containing efficient algorithms that solve the set similarity join problem with maximum weighted bipartite matching.

Homepage PyPI Jupyter Notebook

License: Apache-2.0
Install: pip install pytokenjoin==0.1.8

Documentation

pyTokenJoin

Overview

TokenJoin is an efficient method for solving the Fuzzy Set Similarity Join problem. It relies only on tokens and their defined utilities, avoiding pairwise comparisons between elements. It is submitted to the International Conference on Very Large Databases (VLDB). This is the repository for the python source code. More information about the original method can be found here.

Installation

You can easily install pytokenjoin from PyPI using pip:

pip install pytokenjoin

Usage

There are two ways to use TokenJoin:

When using a threshold δ, e.g. δ=0.7
When requesting top-k results, e.g. k=100.

There are also two similarity functions supported: Jaccard and Edit Similarity.

More information on how to use the functions can be found on this jupyter notebook.

Dependencies: 0
Dependent packages: 0
Dependent repositories: 0
Total releases: 9
Latest release: Mar 26, 2024
First release: Sep 4, 2023
Stars: 3
Forks: 0
Watchers: 1
Contributors: 1
Repository size: 981 KB
SourceRank: 8

Source repo 2FA enabled: TEXT!
Package manager 2FA enabled: TEXT!
Is security responsive: TEXT!
Dependencies are managed: TEXT!
Issue-free release available: TEXT!
Succession plan available: TEXT!
Package manager 2FA enabled: TEXT!

Releases

0.1.8: Mar 26, 2024
0.1.7: Mar 12, 2024
0.1.6: Dec 4, 2023
0.1.5: Dec 1, 2023
0.1.4: Sep 4, 2023
0.1.3: Sep 4, 2023
0.1.2: Sep 4, 2023
0.1.1: Sep 4, 2023
0.1.0: Sep 4, 2023

Contributors

See all contributors

Something wrong with this page? Make a suggestion

Export .ABOUT file for this package

Last synced: 2024-03-26 15:55:08 UTC

pytokenjoin
Release 0.1.8

Release 0.1.8

0.1.8

0.1.7

0.1.6

0.1.5

0.1.4

0.1.3

0.1.2

0.1.1

0.1.0

Documentation

pyTokenJoin

Overview

Installation

Usage

Stats

Development practices

Releases

Contributors

pytokenjoin Release 0.1.8

Release 0.1.8 Toggle Dropdown 0.1.8 0.1.7 0.1.6 0.1.5 0.1.4 0.1.3 0.1.2 0.1.1 0.1.0

Documentation

pyTokenJoin

Overview

Installation

Usage

Stats

Development practices

Releases

Contributors

pytokenjoin
Release 0.1.8

Release 0.1.8

0.1.8

0.1.7

0.1.6

0.1.5

0.1.4

0.1.3

0.1.2

0.1.1

0.1.0