carpedm

Character shapes image metadata manager for machine learning.


Keywords
machine, learning, framework, deep, image-to-text, japanese, character, image-processing, machine-learning, manager, metadata, ocr, tensorflow
License
MIT
Install
pip install carpedm==0.1.0

Documentation

Logo

CarpeDM: Sieze the Data Manager

Latest Version Continuous Integration Testing License Docs

Docs | Install Guide | Tutorial

Description

CarpeDM is a general library for downloading, viewing, and manipulating image data. Originally developed as a ChARacter shaPE Data Manager, CarpeDM aims to make Japanese character shape (字形) data and other image datasets more accessible to machine learning researchers.

Datasets Currently Available for Download
ID Dataset
pmjtc
provided by the Center for Open Data in the Humanities (CODH).

Though still in the early stages of development, a high-level interface is also provided for (1) automatic model-ready data generation and (2) flexible training of models with a variety of deep learning frameworks.

Currently supported deep learning frameworks:

Installation

To install CarpeDM use pip:

$ pip install carpedm

See the installation guide for more details.

Overview

A brief overview of the CarpeDM interface is given below.

For a complete guide on usage and available functionality, please refer to the documentation

Interface

Using Your Data

If you would like to use CarpeDM with your data, please follow the directory structure of the sample dataset. The current version of CarpeDM supports CSV files that follow the naming scheme and format here, for example.

Contributing

Contributions are welcome! Please see the contribution guide

Versioning

We use SemVer for versioning. For the versions available, see the tags on this repository.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments