AlpinDale 0e6c400b13 feat: re-add GGUF (#600) 4 months ago
..
aqlm dc1b59df9c fix: compiler warnings for _C and _moe 4 months ago
autoquant 156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569) 5 months ago
awq dc1b59df9c fix: compiler warnings for _C and _moe 4 months ago
compressed_tensors 156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569) 5 months ago
cutlass_w8a8 9d98f29b3a chore: update cutlass to 3.5.1 4 months ago
exl2 156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569) 5 months ago
fp8 dc1b59df9c fix: compiler warnings for _C and _moe 4 months ago
gguf 0e6c400b13 feat: re-add GGUF (#600) 4 months ago
gptq 156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569) 5 months ago
gptq_marlin 141672a0d4 kernels: disambiguate quantized types via a new ScalarType 4 months ago
int8_kvcache 9810daa699 feat: INT8 KV Cache (#298) 10 months ago
marlin 141672a0d4 kernels: disambiguate quantized types via a new ScalarType 4 months ago
quip dc1b59df9c fix: compiler warnings for _C and _moe 4 months ago
squeezellm dc1b59df9c fix: compiler warnings for _C and _moe 4 months ago
quant_ops.h 0e6c400b13 feat: re-add GGUF (#600) 4 months ago