.. |
awq
|
5053743c1c
feat: speedup AWQ (#223)
|
11 months ago |
fp8_e5m2_kvcache
|
8e1cd54497
fix: do not include fp8 for rocm (#271)
|
10 months ago |
gguf
|
c3a221eb02
feat: GGUF, QuIP#, and Marlin support (#228)
|
11 months ago |
gptq
|
8fa608aeb7
feat: replace Ray with NCCL for control plane comms (#221)
|
11 months ago |
marlin
|
aebd68c632
feat: backport kernels (#235)
|
11 months ago |
quip
|
aebd68c632
feat: backport kernels (#235)
|
11 months ago |
squeezellm
|
8fa608aeb7
feat: replace Ray with NCCL for control plane comms (#221)
|
11 months ago |