AlpinDale ba371fbbbd feat: AWQ marlin kernels (#603) | vor 5 Monaten | |
---|---|---|
.. | ||
schemes | ba371fbbbd feat: AWQ marlin kernels (#603) | vor 5 Monaten |
__init__.py | f4ea11b982 feat: initial support for activation quantization | vor 6 Monaten |
compressed_tensors.py | 9be43994fe feat: fbgemm quantization support (#601) | vor 5 Monaten |
utils.py | 00503b9fc1 feat: non-uniform quantization via `compressed-tensors` for llama | vor 5 Monaten |