data-manipulation

Powerful data manipulation


Keywords
pandas, pyspark, python, utility-library
License
BSD-1-Clause
Install
pip install data-manipulation==0.37

Documentation

Data Manipulation

My data manipulation library includes functions build on top of popular Python libraries such as Pandas, PySpark and more.

What is it?

Data Manipulation is a Python package providing powerful utility functions. It contains many subpackages with utility functions built for popular packages such as Pandas, PySpark and many more.

Where to get it

The source code is currently hosted on GitHub at: https://github.com/shawnngtq/data-manipulation

pip install data-manipulation

Dependencies

The dependencies will be installed automatically along with this package

  • beautifulsoup4
  • django
  • numpy
  • pandas
  • ipython
  • requests
  • psycopg2
  • pyspark

License

BSD 3

Getting Help

For usage questions, the best place to go to is StackOverflow.

Discussion and Development

Most development discussion is taking place on github in this repo.

Contributing to data manipulation

All contributions, bug reports, bug fixes, documentation improvements, enhancements and ideas are welcome.

As contributors and maintainers to this project, you are expected to abide by our code of conduct. More information can be found at: Contributor Code of Conduct