inspectdf
Overview
inspectdf
is collection of utilities for columnwise summary,
comparison and visualisation of data frames. Functions are provided to
summarise missingness, categorical levels, numeric distribution,
correlation, column types and memory usage.
The package has three aims:
- to speed up repetitive checking and exploratory tasks for data frames
- to make it easier to compare data frames for differences and inconsistencies
- to support quick visualisation of data frames
Check out the package website for further documentation and examples.
Installation
To install the development version of the package, use
devtools::install_github("alastairrushworth/inspectdf")
To install the CRAN version of the package, use
install.packages("inspectdf")
Key functions
-
inspect_types()
summary of column types -
inspect_mem()
summary of memory usage of columns -
inspect_na()
columnwise prevalence of missing values -
inspect_cor()
correlation coefficients of numeric columns -
inspect_imb()
feature imbalance of categorical columns -
inspect_num()
summaries of numeric columns -
inspect_cat()
summaries of categorical columns
Comments? Suggestions? Issues?
Any feedback is welcome! Feel free to write a github issue or send me a message on twitter.