pygrobid

A python client for Grobid service


License
MIT-feh
Install
pip install pygrobid==0.1.6

Documentation

Simple python client for GROBID REST services

This Python client is used to process a single PDF document by the GROBID service. Results are returned in an XML format.

Build and run

You need first to install and start the grobid service, latest stable version, see the documentation. The default server host is localhost and port is 8080. The GrobidClient can be configure via host and port.

Requirements

This client has been developed and tested with Python 3.7.

Install

pip install pygrobid

Usage and options

You can take a quick test via python tests.py pdf_file -h host -p port

In your code:

from grobid.client import GrobidClient

client = GrobidClient(host, port)
rsp = client.serve(service_name, pdf_file)
rsp = client.serve(service_name, pdf_file, consolidate_header=1)

upgrade history

  • 0.1.6: support coordinates

Acknoledgement

This project is based on grobid-python-client by Patrice Lopez (patrice.lopez@science-miner.com)