AlpinDale d0afe0cd21 fix: suppress mma.sp warning on CUDA 12.5 and above 7 月之前
..
aqlm 156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569) 7 月之前
autoquant 156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569) 7 月之前
awq 156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569) 7 月之前
compressed_tensors 156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569) 7 月之前
cutlass_w8a8 94f4e278ff fix: illegal mem access for cutlass fp8 kernels 7 月之前
exl2 156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569) 7 月之前
fp8 37c6da9eb3 feat: vectorized fp8 quant kernel 7 月之前
gguf 9d81716bfd [v0.5.3] Release Candidate (#388) 10 月之前
gptq 156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569) 7 月之前
gptq_marlin 156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569) 7 月之前
int8_kvcache 9810daa699 feat: INT8 KV Cache (#298) 1 年之前
marlin d0afe0cd21 fix: suppress mma.sp warning on CUDA 12.5 and above 7 月之前
quip 156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569) 7 月之前
squeezellm 156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569) 7 月之前
quant_ops.h 7e54c3916d chore: factor out epilogues from cutlass kernels 7 月之前