kanjinetworks is an interface for the Kanji Networks database.
Kanji Networks was a website offering etymologies for kanjis. The website was closed end of August 2016, but the database was made available as a PDF.
kanjinetworks is an interface for extracting, parsing and exporting the etymologies from the PDF file in python.
The package also include the
kn_to_ja.py script to import the Kanji Networks etymologies into a iOS Japanese App backup files as notes. Warning: existing notes might be lost as this script replaces them.
kanjinetworks is compatible with Python versions 2 and depends on PDF miner.
To install kanjinetworks, run pip:
pip install kanjinetworks
or clone this directory and run setup:
python setup.py install
To import etymologies into a Japanese App backup files:
To use the parser:
from kanjinetworks import get_text from kanjinetworks import KanjiNetworksParser text = get_text() kanjis = KanjiNetworksParser().parse(text) for kanji in kanjis: print unicode(kanji)
The Etymological Dictionary of Han/Chinese Characters is by Lawrence J. Howell / Research Collaborator Hikaru Morimoto / Kanji Networks (http://www.kanjinetworks.com).
File distributed with the permission of the author.