😎 Cocoon organizes your data warehouse using LLM agents, preparing it for analysis. It connects to your data warehouses to explore and understand your data. It helps you interactively build, test, and maintain pipelines.
-
📚 We currently help you profile, clean and standardize your tables. Learn more about current features
-
👉 Check out the Youtube demo. Need support? Email: zh2408@columbia.edu
👉 Try this Google Collab Notebook
Cocoon is available on PyPI:
pip install cocoon_data
To get started, you need to connect to
- LLMs (e.g., GPT-4, Claude-3, Gemini-Ultra...)
- Data Warehouses (e.g., Snowflake, Duckdb...)
from cocoon_data import *
# if you use Open AI, please ensure GPT-4 is available
openai.api_key = 'xycabc'
# if you use Snowflake
con = snowflake.connector.connect(...)
query_widget, cocoon_workflow = create_cocoon_workflow(con)
# a helper widget to query your data warehouse
query_widget.display()
# the main panel to interact with Cocoon
cocoon_workflow.start_workflow()
🎉 You shall see the following on a notebook: