.. |
aqlm
|
705821a7fe
feat: AQLM quantization support (#293)
|
10 miesięcy temu |
awq
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
8 miesięcy temu |
bitsandbytes
|
a98babfb74
fix: bnb on Turing GPUs (#299)
|
10 miesięcy temu |
exl2
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
8 miesięcy temu |
fp8
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
8 miesięcy temu |
fp8_e5m2_kvcache
|
8e1cd54497
fix: do not include fp8 for rocm (#271)
|
10 miesięcy temu |
gguf
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
8 miesięcy temu |
gptq
|
8cf8fa3f09
don't assign meta tensor to a cuda param
|
7 miesięcy temu |
int8_kvcache
|
9810daa699
feat: INT8 KV Cache (#298)
|
10 miesięcy temu |
marlin
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
8 miesięcy temu |
quip
|
aebd68c632
feat: backport kernels (#235)
|
11 miesięcy temu |
squeezellm
|
8fa608aeb7
feat: replace Ray with NCCL for control plane comms (#221)
|
11 miesięcy temu |
quant_ops.cpp
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
8 miesięcy temu |
quant_ops.h
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
8 miesięcy temu |