This solution build by design (configuration) simple of complex Data Pipelines


License
Other
Install
pip install pipelite==0.1.1

Documentation

The pipelite Project

Empower your data workflows effortlessly with pipelite, a lightweight Python program designed for seamless data pipeline creation and execution. Using a simple JSON configuration, users can build complex pipelines without writing code. What sets pipelite apart is its total extensibility—anyone can easily create and integrate new connectors or transformations, enhancing the program's capabilities.

It's also possible to add new way to manage the flow of the transformations if needed. With a MIT license fostering collaboration, this flexible tool is perfect for users of all levels. Craft, execute, and extend your data pipelines with pipelite, your go-to solution for adaptable and scalable data processing.

Some characteristics:

  • Simple JSON configuration
  • Lightweight and code-free (MIT license for flexibility)
  • Python Code (leverage the basics libraries instead addind many heavy and complex libs)
  • Effortless pipeline creation and high integrability thanks to the json configuration
  • Streamlined execution process
  • Total extensibility (connectivity, transformation, pipeline management)
  • Boost data processing efficiency
  • Quick learning curve
  • Empower your data workflows in a simple way

So in one word ... pipelite is your extensible solution for dynamic data pipelines.

Currently this solution provides data access and load from these data sources :

External file (csv)
External Excel Spreadsheet (xls, xlsx, xlsm, xlsb, odf, ods and odt) (read only)
External XES File (read only)
ODBC Data Sources (checked with SQL Server, SQLite) by using an configurable SQL query (Read Only)
SAP Read Table via SAP RFC (Read Only)
ABBYY Timeline PI (write only in Repository)

And provides those transformers

Pass Through (Ex. just to change the Data Sources names IN-OUT)
Dataset Profiling
Concat 2 Data sources
Join data sources
Lookup
SubString
Rename Column Name
Column Transformation

👉 Jump to the wiki from here

Installation

just use pip by typing

    pip install pipelite

... and just use it !

See here

External modules