waterflow

Dataflow package provides a data analysis pipelineframework for data transformation and machine learning


Keywords
data, science, analysis, machine, learning
License
MIT
Install
pip install waterflow==v0.3

Documentation

Waterflow

Build Status

Waterflow package provides a data analysis pipeline framework for data transformation and machine learning

In one go

  • read data files (possibly massive)
  • add transformations and new features
  • train a model
  • test a model
  • score

Usage

Example display first line of somefile.txt after applying a function lamba and used another lambda for filtering.

from flow import Flow

flow = Flow()
print flow.read_file('somefile.txt').map(lambda).filter(lambda).batch(10)[0]