kanjinetworks

Kanji Networks interface


License
MIT
Install
pip install kanjinetworks==0.1.6

Documentation

Kanji Networks

kanjinetworks is an interface for the Kanji Networks database.

Build Pypi version Pypi license Python 2

Description

Kanji Networks was a website offering etymologies for kanjis. The website was closed end of August 2016, but the database was made available as a PDF.

kanjinetworks is an interface for extracting, parsing and exporting the etymologies from the PDF file in python.

The package also include the kn_to_ja.py script to import the Kanji Networks etymologies into a iOS Japanese App backup files as notes. Warning: existing notes might be lost as this script replaces them.

Requirements

kanjinetworks is compatible with Python versions 2 and depends on PDF miner.

Install

To install kanjinetworks, run pip:

pip install kanjinetworks

or clone this directory and run setup:

python setup.py install

Usage

To import etymologies into a Japanese App backup files:

kn_to_ja.py PATH_TO_JAPANESE_BACKUP_FILE

To use the parser:

from kanjinetworks import get_text
from kanjinetworks import KanjiNetworksParser

text = get_text()

kanjis = KanjiNetworksParser().parse(text)
for kanji in kanjis:
    print unicode(kanji)

Credits

The Etymological Dictionary of Han/Chinese Characters is by Lawrence J. Howell / Research Collaborator Hikaru Morimoto / Kanji Networks (http://www.kanjinetworks.com).

File distributed with the permission of the author.