citation-extractor

A tool to extract canonical references from text.


Keywords
canonical-references, classics, digital-classics, extraction, python
License
GPL-3.0
Install
pip install citation-extractor==1.6.3

Documentation

(Canonical) Citation Extractor

Status

DOI Build Status codecov

Installation

This software supports Python version 2.7, and it was tested only on POSIX–compliant operating systems (Linux, Mac OS X, FreeBSD, etc.).

To install the CitationExtractor first run:

$ pip install http://www.antlr3.org/download/Python/antlr_python_runtime-3.1.3.tar.gz#egg=antlr_python_runtime-3.1.3
$ pip install https://github.com/mromanello/treetagger-python/archive/master.zip#egg=treetagger-1.0.1

followed by:

$ pip install citation-extractor

NB: the installation of all other dependencies is handled by setup.py but for some reason (that I'm still trying to figure out) it does not pick up these two.

Documentation

I'm working on it ;-)

For the time being, you can find a concrete example of how to use the library in this notebook.