DashInfer is a native inference engine for Pre-trained Large Language Models (LLMs) developed by Tongyi Laboratory.


License
Apache-2.0
Install
pip install dashinfer==2.1.0