AlpinDale 73177656ed feat: quant_llm support (#755) vor 4 Monaten
..
compressed_tensors 04da8c33bd Revert "chore: use the `compressed-tensors` library to avoid code reuse (#704)" (#706) vor 4 Monaten
gguf_utils 9d81716bfd [v0.5.3] Release Candidate (#388) vor 8 Monaten
utils 73177656ed feat: quant_llm support (#755) vor 4 Monaten
__init__.py 73177656ed feat: quant_llm support (#755) vor 4 Monaten
aqlm.py ccbda97416 fix: types in AQLM and GGUF for dynamo support (#736) vor 4 Monaten
autoquant.py f1d0b77c92 [0.6.0] Release Candidate (#481) vor 4 Monaten
awq.py edec2e9a9e feat: migrate awq and awq_marlin to AphroditeParameter (#702) vor 4 Monaten
awq_marlin.py edec2e9a9e feat: migrate awq and awq_marlin to AphroditeParameter (#702) vor 4 Monaten
base_config.py f1d0b77c92 [0.6.0] Release Candidate (#481) vor 4 Monaten
bitsandbytes.py f1d0b77c92 [0.6.0] Release Candidate (#481) vor 4 Monaten
deepspeedfp.py f1d0b77c92 [0.6.0] Release Candidate (#481) vor 4 Monaten
eetq.py f1d0b77c92 [0.6.0] Release Candidate (#481) vor 4 Monaten
exl2.py f1d0b77c92 [0.6.0] Release Candidate (#481) vor 4 Monaten
experts_int8.py d34e083c48 feat: add experts_int8 support (#730) vor 4 Monaten
fbgemm_fp8.py f1d0b77c92 [0.6.0] Release Candidate (#481) vor 4 Monaten
fp6.py 73177656ed feat: quant_llm support (#755) vor 4 Monaten
fp8.py d34e083c48 feat: add experts_int8 support (#730) vor 4 Monaten
gguf.py f1d0b77c92 [0.6.0] Release Candidate (#481) vor 4 Monaten
gptq.py ccbda97416 fix: types in AQLM and GGUF for dynamo support (#736) vor 4 Monaten
gptq_marlin.py 4f6020cc86 chore: migrate gptq_marlin to AphroditeParameters (#699) vor 4 Monaten
gptq_marlin_24.py f1d0b77c92 [0.6.0] Release Candidate (#481) vor 4 Monaten
hadamard.safetensors 9d81716bfd [v0.5.3] Release Candidate (#388) vor 8 Monaten
kv_cache.py f1d0b77c92 [0.6.0] Release Candidate (#481) vor 4 Monaten
marlin.py f1d0b77c92 [0.6.0] Release Candidate (#481) vor 4 Monaten
qqq.py f1d0b77c92 [0.6.0] Release Candidate (#481) vor 4 Monaten
quip.py f1d0b77c92 [0.6.0] Release Candidate (#481) vor 4 Monaten
quip_utils.py 9d81716bfd [v0.5.3] Release Candidate (#388) vor 8 Monaten
schema.py 9d81716bfd [v0.5.3] Release Candidate (#388) vor 8 Monaten
squeezellm.py f1d0b77c92 [0.6.0] Release Candidate (#481) vor 4 Monaten
tpu_int8.py 3f49a55f82 feat: add INT8 W8A16 quant for TPU (#663) vor 4 Monaten