txt-utils
CLI to modify text files.
Features
-
merge
: merge multiple text files into one -
extract-vocabulary
: extract unit vocabulary -
transcribe
: transcribe units -
replace
: replace text -
replace-line
: replace text in a line -
trim-units
: trim units -
remove-units
: remove units -
create-unit-occurrence-stats
: create unit occurrence statistics
Roadmap
- create n-grams
- map units
- merge units right/left
- calculate units TF-IDF
Installation
pip install txt-utils --user
Usage
txt-utils-cli
Citation
If you want to cite this repo, you can use the BibTeX-entry generated by GitHub (see About => Cite this repository).
Taubert, S. (2024). txt-utils (Version 0.0.3) [Computer software]. https://doi.org/10.5281/zenodo.10571273
Acknowledgments
Funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) – Project-ID 416228727 – CRC 1410