AlpinDale b03b4d4c8c fix: compute cutlass 3.x epilogues in fp32 instead of 16 vor 7 Monaten
..
aqlm 156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569) vor 7 Monaten
autoquant 156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569) vor 7 Monaten
awq 156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569) vor 7 Monaten
compressed_tensors 156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569) vor 7 Monaten
cutlass_w8a8 b03b4d4c8c fix: compute cutlass 3.x epilogues in fp32 instead of 16 vor 7 Monaten
exl2 156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569) vor 7 Monaten
fp8 37c6da9eb3 feat: vectorized fp8 quant kernel vor 7 Monaten
gguf 9d81716bfd [v0.5.3] Release Candidate (#388) vor 10 Monaten
gptq 156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569) vor 7 Monaten
gptq_marlin 156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569) vor 7 Monaten
int8_kvcache 9810daa699 feat: INT8 KV Cache (#298) vor 1 Jahr
marlin 1587fab5de fix: cuda version check for mma warning suppression vor 7 Monaten
quip 156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569) vor 7 Monaten
squeezellm 156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569) vor 7 Monaten
quant_ops.h 5b464d36ea feat: bias epilogue support for cutlass kernels vor 7 Monaten