chana

A library of NLP tools for the shipibo-konibo language


Keywords
nlp, shipibo, development, lemmatization, named-entity-recognition, pos-tagging, python, syllabification
License
MIT
Install
pip install chana==0.9

Documentation

Chana: An NLP toolkit for the Shipibo-Konibo language of Peru.

chana is a Python library of various NLP tools for the Shipibo-Konibo. Some of these tools can be reused on other peruvian indigenous and/or highly agglutinative languages. It is built on top of scikit-learn, python-crfsuite and distributed under MIT license.

Chana has various NLP tools such as:

  • Lemmatizer.
  • Part-of-Speech tagger.
  • Named Entity annotation.
  • Syllabificator.

Installation

Dependencies

Chana requires:

  • Python (>= 3.4)
  • NumPy (>= 1.13.1)
  • Scikit-learn (>= 0.18.1)
  • Python-crfsuite (>= 0.9.5)

User installation

If you already have a working installation of numpy, scikit-learn and python-crfsuite, the easiest way to install chana is using pip :

pip install chana

Help and Support

Important links

Communication

Contact

For any question and feedback please contact:

Acknowledgments

  • Pontificia Universidad Cat贸lica del Per煤 (PUCP)
  • Consejo Nacional de Ciencia, Tecnolog铆a e Innovaci贸n Tecnol贸gica (CONCYTEC)
  • NVIDIA
  • Amazon Web Services