Commonly used functions/utilities for DDF file model.


Keywords
etl, ddf, datasets, recipe, dataset
License
MIT
Install
pip install ddf-utils==1.0.12

Documentation

ddf_utils

status version pyversion codecov

ddf_utils is a Python library and command line tool for people working with Tabular Data Package in DDF model. It provides various functions for ETL tasks, including string formatting, data transforming, generating datapackage.json, reading data form DDF datasets, running recipes, a decleative DSL designed to manipulate datasets to generate new datasets, and other functions we find useful in daily works in Gapminder.

Installation

Python 3.6+ is required in order to run this module.

To install this package from pypi, run:

$ pip install ddf_utils

To install from the latest source, run:

$ pip install git+https://github.com/semio/ddf_utils.git

For Windows users

If you encounter failed to create process when you run the ddf command, please try updating setuptools to latest version:

$ pip3 install -U setuptools

and then reinstall ddf_utils should fix the problem.

Usage

Check the documents for how to use ddf_utils.