Cocoon is an open-source project that aims to free analysts from tedious data transformations with LLM.


License
MIT
Install
pip install cocoon-data==0.1.28

Documentation

Cocoon Logo

License: MIT

😎 Cocoon organizes your data warehouse using LLM agents, preparing it for analysis. Specifically, Cocoon helps you with data cleaning, data integration, and data modeling.

Get Started

👉 Try this Google Collab Notebook

Cocoon is available on PyPI:

pip install cocoon_data

To get started, you need to connect to

  • LLMs (e.g., GPT-4, Claude-3, Gemini-Ultra, or your local LLMs)
  • Data Warehouses (e.g., Snowflake, Duckdb...)
from cocoon_data import *

# if you use Open AI GPT-4
openai.api_key  = 'xycabc'

# if you use Snowflake
con = snowflake.connector.connect(...)

query_widget, cocoon_workflow = create_cocoon_workflow(con)

# a helper widget to query your data warehouse
query_widget.display()

# the main panel to interact with Cocoon
cocoon_workflow.start_workflow()

🎉 You shall see the following on a notebook: