dataRX

Extension used to import, read, and give preliminary descriptive statistics of data files.


License
0BSD
Install
pip install dataRX==2.0

Documentation

dataRX

A simple extension used to give a descriptive statistical analysis of your data set.

dataRX is a package that was built out of a need that arised from competing in Kaggle competitions. It can get quite tedious and time-consuming to have to search through a datset to obtain its general info, each time you want to enter a compeition. You want to spend more of your time on feature engineering, creating models, and developing pipeline processes. So with this in mind, dataRX was created to get preliminary info, import, and generate an initial report on the dataset.

What you'll need

numpy >= 1.11.0
pandas >= 0.17.1
scipy >= 0.17.0

Questions & inquiries

Email: rashadaalston@gmail.com
LinkedIn: http://linkedin.com/in/rashadalston
Website: http://rashadalston.com
Github: http://github.com/ralston3

Documentation

Install

Use: pip install dataRX

Import

Simply use from dataRX.processor import RX and generate your report as follows. file will be the data file that you want to process.

Your output from rx.report() should look something like this. save_as will be the location where you intend to save your report.

You can also use the RX object to do things such as view the file's general info, as follows:

Which should output something similar to:

It's a pretty simple package, but quite useful if you're a like me and prefer to get right into the data without doing all of the preliminary things. Look for upgrades soon!

version 0.4.2