Cocoon is an open-source project that aims to free analysts from tedious data transformations with LLM.


License
MIT
Install
pip install cocoon-data==0.1.72

Documentation

Cocoon Logo

License: MIT

😎 Cocoon organizes your data warehouse using LLM agents, preparing it for analysis. It connects to your data warehouses to explore and understand your data. It helps you interactively build, test, and maintain pipelines.

Get Started

👉 Try this Google Collab Notebook

Cocoon is available on PyPI:

pip install cocoon_data

To get started, you need to connect to

  • LLMs (e.g., GPT-4, Claude-3, Gemini-Ultra...)
  • Data Warehouses (e.g., Snowflake, Duckdb...)
from cocoon_data import *

# if you use Open AI, please ensure GPT-4 is available
openai.api_key  = 'xycabc'

# if you use Snowflake
con = snowflake.connector.connect(...)

query_widget, cocoon_workflow = create_cocoon_workflow(con)

# a helper widget to query your data warehouse
query_widget.display()

# the main panel to interact with Cocoon
cocoon_workflow.start_workflow()

🎉 You shall see the following on a notebook: