kraken-pyds

kraken: python distributed data transfer tool.


Keywords
thrift, python, kraken, hdfs, aws-s3, data-backup, data-life-cycle, data-migration, data-transfer, disaster-recovery, gcs
License
MIT
Install
pip install kraken-pyds==2.0.1

Documentation

https://img.shields.io/pypi/v/kraken-pyds https://img.shields.io/pypi/pyversions/kraken-pyds https://img.shields.io/travis/com/yassineazzouz/kraken https://img.shields.io/codecov/c/github/yassineazzouz/kraken

Kraken

Kraken is a distributed, fast, and reliable data transfer service for efficiently moving large amounts of data between different storage technologies. It uses a distributed master worker architecture, to allow very fast data transfer speed and horizontal scaling, while implementing multiple failover, recovery and data consistency checks mechanisms to ensure robustness, fault tolerance and reliability.

Kraken typical use cases include but are not limited to : - Data migration (across storage technologies) - Data Replication. - Backup and Disaster recovery. - Data life cycle management.

Features

  • Distributed data transfer service.
  • Support multiple storage technologies.
  • Easy to setup, configurable services.
  • Command line interface to submit and monitor jobs.
  • Support multiple scheduling algorithms and job priority.
  • Support multiple jobs placement policies across workers.

Install

pip install kraken-pyds