Diving equipment for data lake


Keywords
data_visualization, data_exploration, charts
License
MIT
Install
pip install dataswim==0.5.1

Documentation

Dataswim

Build Status Coverage Status Documentation

A simple api to explore, clean, transform and visualize data. This api is:

  • Minimalistic: short names, simple functionalites, minimal parameters
  • Pragmatic: focuses on raw efficiency rather than idiomatic code
  • Simple stupid: keep it easy to understand for both code and api

Features

  • Explore data: describe, search and visualize raw data
  • Clean and transform data: select, filter, normalize and reshape data
  • Visualize data: many kind of charts and maps for geo data

Install

Using conda:

conda install pandas sqlalchemy seaborn arrow nltk scikit-learn
conda install -c ioam holoviews bokeh
conda install altair --channel conda-forge
pip install --no-cache-dir dataset
pip install pytablewriter goerr gencharts chartjspy stuf \
   deepdish folium influxdb

pip install dataswim --no-deps

Using pip:

pip install --no-cache-dir dataset
pip install dataswim

To get the Altair charts in notebooks running install Vega:

conda install -c conda-forge vega
# or
pip install vega

Documentation

The documentation is available online

Some Jupyter demo notebooks are available as example.

To run the notebooks online click the badge: Binder

Dependencies

To compute data:

To chart data:

To handle databases:

Supported databases

  • Postgresql
  • Sqlite
  • All those that Sql Alchemy supports

Reporting

To generate static html reports use dsreports

To distribute reports with a http server use django-chartflo