NB Workflows

Description

If SQL is a lingua franca for querying data, Jupyter should be a lingua franca for data explorations, model training, and complex and unique tasks related to data.

This workflow platform allows to run parameterized notebooks programmatically. Notebooks could be scheduled with cron syntax, by intervals, run by n times, only once or in time ranges (from 10am to 18pm).

So, the notebook is the main UI and the workflow job description.

Goal

Empowering different data roles in a project to put code into production, simplifying the time required to do so. It enables people to go from a data exploration instance to an entirely pipeline deployed in production, using the same notebook file made by a data scientist, analyst or whatever role working with data in an iterative way.

How to run

# launch web process
# swagger by default: http://localhost:8000/docs/swagger 
make web

# RQ Worker
make rqworker

# RQScheduler (optional)
make rqscheduler

Architecture

References & inspirations

Notebook Innovation - Netflix
Tensorflow metastore
Maintainable and collaborative pipelines

labfunctions
Release 0.9.0a12

Release 0.9.0a12

0.9.0a15

0.9.0

0.9.0a12

0.9.0a11

0.9.0a5

0.9.0a3

0.9.0a2

0.9.0a1

0.9.0a0

0.8.0

Documentation

NB Workflows

Description

Goal

How to run

Architecture

References & inspirations

Stats

Development practices

Releases

Contributors

labfunctions Release 0.9.0a12

Release 0.9.0a12 Toggle Dropdown 0.9.0a15 0.9.0 0.9.0a12 0.9.0a11 0.9.0a5 0.9.0a3 0.9.0a2 0.9.0a1 0.9.0a0 0.8.0

Documentation

NB Workflows

Description

Goal

How to run

Architecture

References & inspirations

Stats

Development practices

Releases

Contributors

labfunctions
Release 0.9.0a12

Release 0.9.0a12

0.9.0a15

0.9.0

0.9.0a12

0.9.0a11

0.9.0a5

0.9.0a3

0.9.0a2

0.9.0a1

0.9.0a0

0.8.0