AlpinDale 9810daa699 feat: INT8 KV Cache (#298) 1 year ago
..
aqlm 705821a7fe feat: AQLM quantization support (#293) 1 year ago
awq 5053743c1c feat: speedup AWQ (#223) 1 year ago
bitsandbytes 82955ba440 fix: backport bnb kernels (#297) 1 year ago
fp8_e5m2_kvcache 8e1cd54497 fix: do not include fp8 for rocm (#271) 1 year ago
gguf c3a221eb02 feat: GGUF, QuIP#, and Marlin support (#228) 1 year ago
gptq 8fa608aeb7 feat: replace Ray with NCCL for control plane comms (#221) 1 year ago
int8_kvcache 9810daa699 feat: INT8 KV Cache (#298) 1 year ago
marlin 72229a94da feat: better marlin kernels (#285) 1 year ago
quip aebd68c632 feat: backport kernels (#235) 1 year ago
squeezellm 8fa608aeb7 feat: replace Ray with NCCL for control plane comms (#221) 1 year ago