Annotates corpuses for use in FastText over a Command Line Interface.
π
Raison D'Γͺtre This allows a large corpus to be annotated easily using a command line interface for purposes of being used in a FastText supervised classifier.
π
Architecture fasttext-annotator
has a very simple architecture that basically reads each line of the file fed to it, and asks the user for an annotation given the options provided to it via the command line.
π
Dependencies
π₯
Installation You can install this via the PyPi package manager, or manually.
Manually
- Run
python setup.py --install
Via Pypi
- Run
pip install fasttextannotator
π
Usage example In order to annotate a corpus of text, execute the following:
fasttextannotator [corpus-file] [output-file] [labels...]
You will then be presented with a line of text from the corpus, and a range of options based on the labels you provided.
π
License The project is available under the MIT license.
Acknowledgements
- Icon in README banner is text by Chameleon Design from the Noun Project.