An extensive library for processing FoLiA documents. FoLiA stands for Format for Linguistic Annotation and is a very rich XML-based format used by various Natural Language Processing tools.


Keywords
nlp, computational, linguistics, folia, format, xml, annotation, clariah, clarin, computational-linguistics, pynlpl
License
GPL-3.0-only
Install
pip install FoLiA==2.5.11

Documentation

FoLiA Library for Python

image

Documentation Status

image

Project Status: Active – The project has reached a stable, usable state and is being actively developed.

Latest release in the Python Package Index

image

This Python module provides an extensive library for parsing, creating and otherwise processing documents in the Format for Linguistic Annotation, aka FoLiA. It has been in active development since 2010 and used by numerous Natural Language Processing (NLP) tools.

This library used to be part of PyNLPL (pynlpl.formats.folia), but has been migrated to this standalone library in spring 2019.

Acknowledgement

FoLiA development is funded in the scope of the larger CLARIN-NL project and its successor CLARIAH.

The following modules are available:

  • folia.main - The main library
  • folia.setdefinition - A module for FoLiA Set Definitions
  • folia.fql - Support for the FoLiA Query Language (FQL)

Installation

Download and install the latest stable version directly from the Python Package Index with pip install folia (or pip3 for Python 3.7+ on most systems). For global installations prepend sudo.

Alternatively, clone this repository and run python setup.py install (or python3 setup.py install for Python 3 on most system. Prepend sudo for global installations.

The Python FoLiA library is also included in our LaMachine distribution.

Documentation

API Documentation and tutorials can be found here .