primetext

package for indexing text datasets using prime number factorisation for fast word frequency analysis


Keywords
prime, factor, text, word, frequency, search, indexing
License
Other
Install
pip install primetext==0.1

Documentation

primetext

python package for indexing text datasets for fast word frequency analysis

Usage

from primetext import primetext

data = ["black cat on mat",
"black hat for you",
"cat sat on you"]

# initiate primetext
pt = primetext.primetext()

# indexing data
pt.index(data)

# finding words
recordsWithCat = pt.find(['cat'])
# returns boolean vector : [True,False,True]

recordsWithCatAndSat = pt.find(['cat','sat'])
# returns boolean vector : [False,False,True]