cutter-ng

Cutter is rule-based multilingual tokenizer that can be adapted to particular text types.


Licenses
LGPL-2.0/LGPL-3.0
Install
pip install cutter-ng==2.4.post1