.. |
aqlm
|
dc1b59df9c
fix: compiler warnings for _C and _moe
|
4 miesięcy temu |
autoquant
|
156f577f79
feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)
|
5 miesięcy temu |
awq
|
dc1b59df9c
fix: compiler warnings for _C and _moe
|
4 miesięcy temu |
compressed_tensors
|
156f577f79
feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)
|
5 miesięcy temu |
cutlass_w8a8
|
9d98f29b3a
chore: update cutlass to 3.5.1
|
4 miesięcy temu |
exl2
|
156f577f79
feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)
|
5 miesięcy temu |
fp8
|
dc1b59df9c
fix: compiler warnings for _C and _moe
|
4 miesięcy temu |
gguf
|
0e6c400b13
feat: re-add GGUF (#600)
|
4 miesięcy temu |
gptq
|
156f577f79
feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)
|
5 miesięcy temu |
gptq_marlin
|
141672a0d4
kernels: disambiguate quantized types via a new ScalarType
|
4 miesięcy temu |
int8_kvcache
|
9810daa699
feat: INT8 KV Cache (#298)
|
10 miesięcy temu |
marlin
|
141672a0d4
kernels: disambiguate quantized types via a new ScalarType
|
4 miesięcy temu |
quip
|
dc1b59df9c
fix: compiler warnings for _C and _moe
|
4 miesięcy temu |
squeezellm
|
dc1b59df9c
fix: compiler warnings for _C and _moe
|
4 miesięcy temu |
quant_ops.h
|
0e6c400b13
feat: re-add GGUF (#600)
|
4 miesięcy temu |