Text Data
A Python package for exploring text data for analysis.
Table of Contents
Pitch
When I analyzed text data for a story I wrote on the way that politicians message themselves in different platforms, I found myself spending an awfully large amount of the time exploring the data. This, of course, is standard in data journalism and in data analysis more broadly. But I found that a lot of the work I was doing could be generalized to other projects.
In particular, I found myself wanting to run quick statistics on various words; wanting to figure out which words made each candidate distinct; and wanting to view examples of each word.
This package is designed to make that exploration easy.
Anti-Pitch
This is not designed to deal with preprocessing or text cleaning at all. Both tasks are both a) too task-specific and
b) too well-handled by larger projects like nltk
or spacy
for it to be useful to incorporate in this work.
Credits
This package was created with Cookiecutter and the audreyr/cookiecutter-pypackage project template.