.. |
aqlm
|
156f577f79
feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)
|
7 months ago |
autoquant
|
156f577f79
feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)
|
7 months ago |
awq
|
156f577f79
feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)
|
7 months ago |
compressed_tensors
|
156f577f79
feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)
|
7 months ago |
cutlass_w8a8
|
94f4e278ff
fix: illegal mem access for cutlass fp8 kernels
|
7 months ago |
exl2
|
156f577f79
feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)
|
7 months ago |
fp8
|
37c6da9eb3
feat: vectorized fp8 quant kernel
|
7 months ago |
gguf
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
10 months ago |
gptq
|
156f577f79
feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)
|
7 months ago |
gptq_marlin
|
156f577f79
feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)
|
7 months ago |
int8_kvcache
|
9810daa699
feat: INT8 KV Cache (#298)
|
1 year ago |
marlin
|
d0afe0cd21
fix: suppress mma.sp warning on CUDA 12.5 and above
|
7 months ago |
quip
|
156f577f79
feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)
|
7 months ago |
squeezellm
|
156f577f79
feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)
|
7 months ago |
quant_ops.h
|
7e54c3916d
chore: factor out epilogues from cutlass kernels
|
7 months ago |