etched-blitz

A high-throughput and memory-efficient inference and serving engine for LLMs


License
Apache-2.0
Install
pip install etched-blitz==0.0.1.dev1