Python library for managing and annotating text corpuses in different formats (ELAN, TIG, TTL, et cetera)


Keywords
nlp, annotation, text, corpus, linguistics, ELAN, transcription, pypi, annotations, nlp-datasets
License
MIT
Install
pip install texttaglib==0.1.1

Documentation

texttaglib

a Python library for managing and annotating text corpuses in different formats (ELAN, TIG, TTL, et cetera).

ReadTheDocs Badge Total alerts Language grade: Python

Warning

⚠️ texttaglib package is now in maintenance mode for supporting legacy software only. All new development will be moved to speach library.

Migration from texttaglib to speach should be trivial

# just change import statements from something like
from texttaglib import elan
# to 
from speach import elan

Installation

# change
pip install texttaglib 
# into
pip install speach

For more information, please visit: https://github.com/neocl/speach

Legacy

texttaglib legacy releases (>= 0.1.1, < 0.2) use embedded chirptext-0.1 and puchikarui-0.1 for supporting legacy APIs.

Here is the sample code:

from texttaglib import ttl
from texttaglib.chirptext import chio
from texttaglib.chirptext import deko
from texttaglib.puchikarui import Schema

Legacy documentation: https://texttaglib.readthedocs.io