AlpinDale e2dbe5f05c feat: add sparse marlin for compressed tensors 7 meses atrás
..
compressed_tensors e2dbe5f05c feat: add sparse marlin for compressed tensors 7 meses atrás
gguf_utils 9d81716bfd [v0.5.3] Release Candidate (#388) 10 meses atrás
__init__.py 517676249c chore: update the compressed-tensors config 7 meses atrás
aqlm.py 156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569) 7 meses atrás
autoquant.py 156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569) 7 meses atrás
awq.py 156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569) 7 meses atrás
base_config.py c66b1b57b1 Marlin 2:4 sparsity (#555) 8 meses atrás
bitsandbytes.py 690110a051 feat: bitsandbytes quantization 7 meses atrás
deepspeedfp.py 4acf34417a feat: add DeepSpeedFP quantization for all models 8 meses atrás
eetq.py b178ae4b4a chore: generalize linear_method to be quant_method (#540) 8 meses atrás
exl2.py 156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569) 7 meses atrás
fp8.py 7e54c3916d chore: factor out epilogues from cutlass kernels 7 meses atrás
gguf.py 156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569) 7 meses atrás
gptq.py 156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569) 7 meses atrás
gptq_marlin.py 156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569) 7 meses atrás
gptq_marlin_24.py 156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569) 7 meses atrás
hadamard.safetensors 9d81716bfd [v0.5.3] Release Candidate (#388) 10 meses atrás
marlin.py 156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569) 7 meses atrás
quip.py 156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569) 7 meses atrás
quip_utils.py 9d81716bfd [v0.5.3] Release Candidate (#388) 10 meses atrás
schema.py 9d81716bfd [v0.5.3] Release Candidate (#388) 10 meses atrás
squeezellm.py 156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569) 7 meses atrás