.. |
compressed_tensors
|
ba371fbbbd
feat: AWQ marlin kernels (#603)
|
4 meses atrás |
gguf_utils
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
8 meses atrás |
utils
|
ba371fbbbd
feat: AWQ marlin kernels (#603)
|
4 meses atrás |
__init__.py
|
ba371fbbbd
feat: AWQ marlin kernels (#603)
|
4 meses atrás |
aqlm.py
|
9be43994fe
feat: fbgemm quantization support (#601)
|
4 meses atrás |
autoquant.py
|
9be43994fe
feat: fbgemm quantization support (#601)
|
4 meses atrás |
awq.py
|
9be43994fe
feat: fbgemm quantization support (#601)
|
4 meses atrás |
awq_marlin.py
|
ba371fbbbd
feat: AWQ marlin kernels (#603)
|
4 meses atrás |
base_config.py
|
9be43994fe
feat: fbgemm quantization support (#601)
|
4 meses atrás |
bitsandbytes.py
|
9be43994fe
feat: fbgemm quantization support (#601)
|
4 meses atrás |
deepspeedfp.py
|
9be43994fe
feat: fbgemm quantization support (#601)
|
4 meses atrás |
eetq.py
|
9be43994fe
feat: fbgemm quantization support (#601)
|
4 meses atrás |
exl2.py
|
9be43994fe
feat: fbgemm quantization support (#601)
|
4 meses atrás |
fbgemm_fp8.py
|
408ca43d2e
feat: support fbgemm_fp8 quant on ampere
|
4 meses atrás |
fp8.py
|
9be43994fe
feat: fbgemm quantization support (#601)
|
4 meses atrás |
gguf.py
|
9be43994fe
feat: fbgemm quantization support (#601)
|
4 meses atrás |
gptq.py
|
9be43994fe
feat: fbgemm quantization support (#601)
|
4 meses atrás |
gptq_marlin.py
|
ba371fbbbd
feat: AWQ marlin kernels (#603)
|
4 meses atrás |
gptq_marlin_24.py
|
9be43994fe
feat: fbgemm quantization support (#601)
|
4 meses atrás |
hadamard.safetensors
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
8 meses atrás |
marlin.py
|
9be43994fe
feat: fbgemm quantization support (#601)
|
4 meses atrás |
quip.py
|
9be43994fe
feat: fbgemm quantization support (#601)
|
4 meses atrás |
quip_utils.py
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
8 meses atrás |
schema.py
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
8 meses atrás |
squeezellm.py
|
9be43994fe
feat: fbgemm quantization support (#601)
|
4 meses atrás |