smoothquant

SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models


Keywords
smoothquant
Install
pip install smoothquant==0.0.1.dev0