pystarburst

PyStarburst DataFrame API allows you to query and transform data in Starburst products in a data pipeline without having to download the data locally.


License
Apache-2.0
Install
pip install pystarburst==0.8.0

Documentation

PyStarburst DataFrame API examples

This repository contains samples for using PyStarburst. In order to use them, just sign up for a free Galaxy account (if you don't have one already) and try out the notebooks.

The detailed API documentation is located at: https://pystarburst.eng.starburstdata.net/

Notebooks

The easiest way to use the notebooks is to start a cloud notebook environment by clicking Binder.

Apps

You can also run the apps locally. To do so, you need to install the dependencies first. Testing has been done with Python 3.10, 3.11 and on MacOS and Linux.

You also need to supply the connection info for the Starburst cluster and OpenAPI APIs in the env.py file.

python3 -m venv venv
source venv/bin/activate
pip install -r requirements.txt

Then, you can run the apps:

python app.py