pdf2json

A python package that converts pdfdocs into json format


Keywords
algolia, pdf, json
License
MIT
Install
pip install pdf2json==0.3

Documentation

pdf2json

N|Solid

Build Status

pdf2json is a package that converts pdf documents into json(displayed per page) and optionally sends them to algolia as index records.

pdf2json uses a number of open source projects to work properly:

  • BeautifulSoup
  • PyPDF
  • AlgoliaSearch

And of course pdf2json itself is open source with a public repository on GitHub.

Installation

pdf2json requires python version 3+ to run.

Install the dependencies and devDependencies and using pip and start using.

Usage

from pdf2json import sendtoalgolia

client = sendtoalgolia(app_id,api_key)

client(algolia_indexname:string,generate_id:boolean,path_to_pdf:string)

License

MIT