AlpinDale f7f3fed265 feat: add async postprocessor (#925) il y a 3 semaines
..
compressed_tensors 5cb2e998d8 quants: update compressed tensors lifecycle to remove `prefix` from `create_weights` (#924) il y a 3 semaines
gguf_utils 8a71788372 Add OLMoE (#772) il y a 2 mois
kernels f7f3fed265 feat: add async postprocessor (#925) il y a 3 semaines
utils 93bc863591 feat: Machete Kernels for Hopper GPUs (#842) il y a 1 mois
__init__.py f98e7b2f8c feat: add HQQ quantization support (#795) il y a 2 mois
aqlm.py ccbda97416 fix: types in AQLM and GGUF for dynamo support (#736) il y a 3 mois
autoquant.py f1d0b77c92 [0.6.0] Release Candidate (#481) il y a 4 mois
awq.py edec2e9a9e feat: migrate awq and awq_marlin to AphroditeParameter (#702) il y a 4 mois
awq_marlin.py 93bc863591 feat: Machete Kernels for Hopper GPUs (#842) il y a 1 mois
base_config.py f1d0b77c92 [0.6.0] Release Candidate (#481) il y a 4 mois
bitsandbytes.py f1d0b77c92 [0.6.0] Release Candidate (#481) il y a 4 mois
deepspeedfp.py f1d0b77c92 [0.6.0] Release Candidate (#481) il y a 4 mois
eetq.py f1d0b77c92 [0.6.0] Release Candidate (#481) il y a 4 mois
exl2.py f1d0b77c92 [0.6.0] Release Candidate (#481) il y a 4 mois
experts_int8.py d34e083c48 feat: add experts_int8 support (#730) il y a 3 mois
fbgemm_fp8.py f1d0b77c92 [0.6.0] Release Candidate (#481) il y a 4 mois
fp6.py 73177656ed feat: quant_llm support (#755) il y a 3 mois
fp8.py afc9a28aa0 chore: add AphroditeParameter support for FP8 quant (#902) il y a 3 semaines
gguf.py f1d0b77c92 [0.6.0] Release Candidate (#481) il y a 4 mois
gptq.py ccbda97416 fix: types in AQLM and GGUF for dynamo support (#736) il y a 3 mois
gptq_marlin.py 93bc863591 feat: Machete Kernels for Hopper GPUs (#842) il y a 1 mois
gptq_marlin_24.py 5d9021969c quants: update `qqq` and `gptq_marlin_24` to use AphroditeParameters (#921) il y a 3 semaines
hadamard.safetensors 9d81716bfd [v0.5.3] Release Candidate (#388) il y a 8 mois
hqq_marlin.py f98e7b2f8c feat: add HQQ quantization support (#795) il y a 2 mois
kv_cache.py f1d0b77c92 [0.6.0] Release Candidate (#481) il y a 4 mois
marlin.py 799667737b quantization: update marlin to use `AphroditeParameters` (#913) il y a 3 semaines
qqq.py 5d9021969c quants: update `qqq` and `gptq_marlin_24` to use AphroditeParameters (#921) il y a 3 semaines
quip.py f1d0b77c92 [0.6.0] Release Candidate (#481) il y a 4 mois
quip_utils.py 8a71788372 Add OLMoE (#772) il y a 2 mois
schema.py 9d81716bfd [v0.5.3] Release Candidate (#388) il y a 8 mois
squeezellm.py f1d0b77c92 [0.6.0] Release Candidate (#481) il y a 4 mois
tpu_int8.py 3f49a55f82 feat: add INT8 W8A16 quant for TPU (#663) il y a 4 mois