TextFlows core text mining module


License
MIT
Install
pip install tf-core==0.3.0

Documentation

TextFlows Core Module

A TextFlows package, which contains the core classes for representing an annotated document corpus, as well as text mining widgets (UI components) based on NLTK. The package can also be used with ClowdFlows 2.0.

Documentation Status

Currently, the project contains several components for text preprocessing: tokenization, stop word removal, lemmatization, part-of-speech tagging, etc.

Installation, documentation

Please find installation instructions, examples and API reference on Read the Docs.

Note

Please note that this is a research project and that drastic changes can be (and are) made pretty regularly. Changes are documented in the CHANGELOG.

Pull requests and issues are welcome.

Contributors to the tf_core package code

Matic Perovšek (@mperice), Matej Martinc (@matejMartinc), Roman Orač (@romanorac)