Check out Upstream on-demand 👉 Watch now!

quant-matmul
Release 0.0.0

Quantized MatMul in CUDA with a PyTorch interface

Homepage PyPI C++

License: Apache-2.0
Install: pip install quant-matmul==0.0.0

Documentation

Quantized matmul in CUDA, with a PyTorch interface

Original code from FasterTransformer / TensorRT-LLM: https://github.com/NVIDIA/TensorRT-LLM/tree/main/cpp/tensorrt_llm/kernels

Adapted to support a different quantization scheme.

Dependencies: 0
Dependent packages: 0
Dependent repositories: 0
Total releases: 4
Latest release: Mar 20, 2024
First release: Jan 9, 2024
Stars: 0
Forks: 1
Watchers: 1
Contributors: 1
Repository size: 221 KB
SourceRank: 7

Source repo 2FA enabled: TEXT!
Package manager 2FA enabled: TEXT!
Is security responsive: TEXT!
Dependencies are managed: TEXT!
Issue-free release available: TEXT!
Succession plan available: TEXT!
Package manager 2FA enabled: TEXT!

Releases

1.2.0: Mar 20, 2024
1.1.1: Jan 28, 2024
1.1.0.post1: Jan 28, 2024
0.0.0: Jan 9, 2024

Contributors

See all contributors

Something wrong with this page? Make a suggestion

Export .ABOUT file for this package

Last synced: 2024-03-20 03:45:00 UTC

Login to resync this project