Open-source Python toolkit for Arabic Natural Understanding, allowing people to integrate it in their system workflow.


Keywords
sinatools
License
MIT
Install
pip install SinaTools==1.0.1

Documentation

SinaTools

Open Source Toolkit for Arabic NLP and NLU developed by SinaLab at Birzeit University. SinaTools is available through Python APIs, command lines, colabs, and online demos.

See the full list of Available Packages, which include: (1) Morphology Tagging, (2) Named Entity Recognition (NER), (3) Word Sense Disambiguation (WSD), (4) Semantic Relatedness, (5) Synonymy Extraction and Evaluation, (6) Relation Extraction, (7) Utilities (diacritic-based word matching, Jaccard similarly, parser, tokenizers, corpora processing, transliteration, etc).

See Demo Pages.

See the benchmarking, which shows that SinaTools outperformed all related toolkits.

Installation

To install SinaTools, ensure you are using Python version 3.11.11, then clone the GitHub repository.

Alternatively, you can execute the following command:

pip install sinatools

Installing Models and Data Files

Some modules in SinaTools require some data files and fine-tuned models to be downloaded. To download these models, please consult the DataDownload.

Documentation

For information, please refer to the main page or the online domuementation.

Citation

Tymaa Hammouda, Mustafa Jarrar, Mohammed Khalilia: SinaTools: Open Source Toolkit for Arabic Natural Language Understanding. In Proceedings of the 2024 AI in Computational Linguistics (ACLing 2024), Procedia Computer Science, Dubai. ELSEVIER.

License

SinaTools is available under the MIT License. See the LICENSE file for more information.

Reporting Issues

To report any issues or bugs, please contact us at "sina.institute.bzu@gmail.com" or visit SinaTools Issues.