AlpinDale afb8cfe945 feat: add most of the EETQ kernels, prune later 7 months ago
..
aqlm 705821a7fe feat: AQLM quantization support (#293) 10 months ago
awq 9d81716bfd [v0.5.3] Release Candidate (#388) 8 months ago
bitsandbytes a98babfb74 fix: bnb on Turing GPUs (#299) 10 months ago
eetq afb8cfe945 feat: add most of the EETQ kernels, prune later 7 months ago
exl2 9d81716bfd [v0.5.3] Release Candidate (#388) 8 months ago
fp8 9d81716bfd [v0.5.3] Release Candidate (#388) 8 months ago
fp8_e5m2_kvcache 8e1cd54497 fix: do not include fp8 for rocm (#271) 10 months ago
gguf 9d81716bfd [v0.5.3] Release Candidate (#388) 8 months ago
gptq 9d81716bfd [v0.5.3] Release Candidate (#388) 8 months ago
int8_kvcache 9810daa699 feat: INT8 KV Cache (#298) 10 months ago
marlin 9d81716bfd [v0.5.3] Release Candidate (#388) 8 months ago
quip aebd68c632 feat: backport kernels (#235) 11 months ago
squeezellm 8fa608aeb7 feat: replace Ray with NCCL for control plane comms (#221) 11 months ago
quant_ops.cpp 9d81716bfd [v0.5.3] Release Candidate (#388) 8 months ago
quant_ops.h 9d81716bfd [v0.5.3] Release Candidate (#388) 8 months ago