pdf-to-markdown

Convert PDF files into markdown files


License
BSD-3-Clause
Install
pip install pdf-to-markdown==0.1.0

Documentation

PDF To Markdown

This is NOT a general-purpose converter. Currently only for urban planning document in Taiwan.

Demo

From this PDF file, we generate:

System Requirement

You should install pdfminer first.

If your PDF file doesn't contain Chinese Characters

sudo apt-get install python-pdfminer

Else

git clone git@github.com:euske/pdfminer.git
cd pdfminer
make cmap
sudo python setup.py install

The make cmap is necessary for documents containing Chinese characters.

Usage

Just type

python main.pdf <pdf>

For example, you can use our example PDF file:

python main.pdf examples/neihu.pdf