lens

Summarise and explore Pandas DataFrames


Keywords
dask, data-exploration, data-science, data-visualisation, dataframe, pandas
License
Apache-2.0
Install
pip install lens==0.4.5

Documentation

lens

lens is a library for exploring data in Pandas DataFrames. It computes single column summary statistics and estimates the correlation between columns. We wrote lens when we realised that the initial steps of acquiring a new data set were almost formulaic: What data type is in this column? How many null values are there? Which columns are correlated? What's the distribution of this value? lens calculates all this for you.

See the documentation for more details.

Installation

lens can be installed from PyPI with pip:

pip install lens

Testing

To run the tests, run:

py.test --ignore=it

License

lens is licensed under the Apache License, see LICENSE.txt for details.