VoCompr (status: POC)
VOCabulary-based COMPRession algorithm
Codec specialized in the compression of texts having a very small disctinct characters set.
Install
pip install vocompr
(or pip install git+https://github.com/enzobnl/vocompr.git
)
Usage
from vocompr import vocompr, unvocompr
with open("path/vopress_input.txt", "r") as input_file:
input_str = input_file.read()
with open("path/output.vocompr", "wb") as output_bytes_file:
output_bytes_file.write(vocompr(input_str))
with open("path/output.vocompr", "rb") as input_bytes_file:
print("original text:", unvocompr(input_bytes_file.read()))
Author
Enzo Bonnal (enzobonnal@gmail.com)