.. |
aqlm
|
dc1b59df9c
fix: compiler warnings for _C and _moe
|
4 ay önce |
autoquant
|
156f577f79
feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)
|
5 ay önce |
awq
|
dc1b59df9c
fix: compiler warnings for _C and _moe
|
4 ay önce |
compressed_tensors
|
156f577f79
feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)
|
5 ay önce |
cutlass_w8a8
|
9d98f29b3a
chore: update cutlass to 3.5.1
|
4 ay önce |
exl2
|
156f577f79
feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)
|
5 ay önce |
fp8
|
dc1b59df9c
fix: compiler warnings for _C and _moe
|
4 ay önce |
gguf
|
0e6c400b13
feat: re-add GGUF (#600)
|
4 ay önce |
gptq
|
156f577f79
feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)
|
5 ay önce |
gptq_marlin
|
141672a0d4
kernels: disambiguate quantized types via a new ScalarType
|
4 ay önce |
int8_kvcache
|
9810daa699
feat: INT8 KV Cache (#298)
|
10 ay önce |
marlin
|
141672a0d4
kernels: disambiguate quantized types via a new ScalarType
|
4 ay önce |
quip
|
dc1b59df9c
fix: compiler warnings for _C and _moe
|
4 ay önce |
squeezellm
|
dc1b59df9c
fix: compiler warnings for _C and _moe
|
4 ay önce |
quant_ops.h
|
0e6c400b13
feat: re-add GGUF (#600)
|
4 ay önce |