.. |
compressed_tensors
|
a33aaf3b42
chore: cleanup compressed tensors
|
hai 7 meses |
gguf_utils
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
hai 10 meses |
__init__.py
|
517676249c
chore: update the compressed-tensors config
|
hai 7 meses |
aqlm.py
|
156f577f79
feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)
|
hai 7 meses |
autoquant.py
|
156f577f79
feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)
|
hai 7 meses |
awq.py
|
156f577f79
feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)
|
hai 7 meses |
base_config.py
|
c66b1b57b1
Marlin 2:4 sparsity (#555)
|
hai 7 meses |
bitsandbytes.py
|
690110a051
feat: bitsandbytes quantization
|
hai 7 meses |
deepspeedfp.py
|
4acf34417a
feat: add DeepSpeedFP quantization for all models
|
hai 7 meses |
eetq.py
|
b178ae4b4a
chore: generalize linear_method to be quant_method (#540)
|
hai 8 meses |
exl2.py
|
156f577f79
feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)
|
hai 7 meses |
fp8.py
|
7e54c3916d
chore: factor out epilogues from cutlass kernels
|
hai 7 meses |
gguf.py
|
156f577f79
feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)
|
hai 7 meses |
gptq.py
|
156f577f79
feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)
|
hai 7 meses |
gptq_marlin.py
|
156f577f79
feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)
|
hai 7 meses |
gptq_marlin_24.py
|
156f577f79
feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)
|
hai 7 meses |
hadamard.safetensors
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
hai 10 meses |
marlin.py
|
156f577f79
feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)
|
hai 7 meses |
quip.py
|
156f577f79
feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)
|
hai 7 meses |
quip_utils.py
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
hai 10 meses |
schema.py
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
hai 10 meses |
squeezellm.py
|
156f577f79
feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)
|
hai 7 meses |