AlpinDale 071269e406 feat: FP8 E4M3 KV Cache (#405) il y a 9 mois
..
aqlm 705821a7fe feat: AQLM quantization support (#293) il y a 10 mois
awq 41beab5dc1 add exllamav2 tensor paralell, fused MoE for GPTQ/AWQ il y a 9 mois
bitsandbytes a98babfb74 fix: bnb on Turing GPUs (#299) il y a 10 mois
exl2 41beab5dc1 add exllamav2 tensor paralell, fused MoE for GPTQ/AWQ il y a 9 mois
fp8 071269e406 feat: FP8 E4M3 KV Cache (#405) il y a 9 mois
fp8_e5m2_kvcache 8e1cd54497 fix: do not include fp8 for rocm (#271) il y a 11 mois
gguf 89c32b40ec chore: add new imatrix quants (#320) il y a 10 mois
gptq 41beab5dc1 add exllamav2 tensor paralell, fused MoE for GPTQ/AWQ il y a 9 mois
int8_kvcache 9810daa699 feat: INT8 KV Cache (#298) il y a 10 mois
marlin 41beab5dc1 add exllamav2 tensor paralell, fused MoE for GPTQ/AWQ il y a 9 mois
quip aebd68c632 feat: backport kernels (#235) il y a 11 mois
squeezellm 8fa608aeb7 feat: replace Ray with NCCL for control plane comms (#221) il y a 1 an