Core API for managing (scientific) data


License
MIT
Install
pip install dtoolcore==3.18.2

Documentation

Manage scientific data sets

PyPi package Travis CI build status (Linux) AppVeyor CI build status (Windows) Code Coverage Documentation Status

Features

  • Core API for adding different types of metadata to files on disk
  • Automatic generation of structural metadata
  • Programmatic discovery and access of items in a dataset
  • Structural metadata includes hash, size and modification time for subsequent integrity checks
  • Ability to annotate individual files with arbitrary metadata
  • Metadata stored on disk as plain text files, i.e. disk datasets generated using this API can be accessed without special tools
  • Ability to create plugins for custom storage solutions
  • Plugins for iRODS and Microsoft Azure storage backends available
  • Cross-platform: Linux, Mac and Windows are all supported
  • Works with Python 2.7, 3.5 and 3.6
  • No external dependencies

Overview

The dtoolcore project provides a Python API for managing (scientific) data. It allows researchers to:

  • Package data and metadata into a dataset
  • Organise and backup datasets easily
  • Find datasets of interest
  • Verify the contents of datasets
  • Discover and work with data programatically