data-commons data-commons

Collection of Open Source libraries that enable working with data at scale


Repositories

data-commons/prep-buddy
A Scala / Java / Python library for cleansing, transforming and preparing large datasets for ML o...
Scala - Apache-2.0 - Last pushed - 8 stars - 4 forks
data-commons/protectr
A Scala / Java / Python library for anonymization, encryption and redaction operations for large ...
Scala - Apache-2.0 - Last pushed - 2 stars
data-commons/pyts
A library for stats module in python
Updated - 0 stars
data-commons/data-commons.github.io
HTML - Last pushed - 0 stars - 1 forks
data-commons/spark-setup
This is a simple setup for spark using maven.
Scala - Last pushed - 0 stars
data-commons/home
This repository is no longer available - 0 stars
See all data-commons's repositories

Published Projects

prep-buddy
A library for cleaning, transforming and executing all other preparation tasks for large datasets...
Latest release 0.5.1 - Updated - 8 stars
See all data-commons's projects

Top Contributors See all

Lalit Sagar Maurya Abhishek Gupta Rahul Nandi Srihari