AlpinDale 22a4cd4595 core: fix spec decode metrics and envs circular import (#889) 3 週間 前
..
compressed_tensors 93bc863591 feat: Machete Kernels for Hopper GPUs (#842) 1 ヶ月 前
gguf_utils 8a71788372 Add OLMoE (#772) 2 ヶ月 前
kernels 93bc863591 feat: Machete Kernels for Hopper GPUs (#842) 1 ヶ月 前
utils 93bc863591 feat: Machete Kernels for Hopper GPUs (#842) 1 ヶ月 前
__init__.py f98e7b2f8c feat: add HQQ quantization support (#795) 2 ヶ月 前
aqlm.py ccbda97416 fix: types in AQLM and GGUF for dynamo support (#736) 3 ヶ月 前
autoquant.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 ヶ月 前
awq.py edec2e9a9e feat: migrate awq and awq_marlin to AphroditeParameter (#702) 4 ヶ月 前
awq_marlin.py 93bc863591 feat: Machete Kernels for Hopper GPUs (#842) 1 ヶ月 前
base_config.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 ヶ月 前
bitsandbytes.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 ヶ月 前
deepspeedfp.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 ヶ月 前
eetq.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 ヶ月 前
exl2.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 ヶ月 前
experts_int8.py d34e083c48 feat: add experts_int8 support (#730) 3 ヶ月 前
fbgemm_fp8.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 ヶ月 前
fp6.py 73177656ed feat: quant_llm support (#755) 3 ヶ月 前
fp8.py 22a4cd4595 core: fix spec decode metrics and envs circular import (#889) 3 週間 前
gguf.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 ヶ月 前
gptq.py ccbda97416 fix: types in AQLM and GGUF for dynamo support (#736) 3 ヶ月 前
gptq_marlin.py 93bc863591 feat: Machete Kernels for Hopper GPUs (#842) 1 ヶ月 前
gptq_marlin_24.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 ヶ月 前
hadamard.safetensors 9d81716bfd [v0.5.3] Release Candidate (#388) 8 ヶ月 前
hqq_marlin.py f98e7b2f8c feat: add HQQ quantization support (#795) 2 ヶ月 前
kv_cache.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 ヶ月 前
marlin.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 ヶ月 前
qqq.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 ヶ月 前
quip.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 ヶ月 前
quip_utils.py 8a71788372 Add OLMoE (#772) 2 ヶ月 前
schema.py 9d81716bfd [v0.5.3] Release Candidate (#388) 8 ヶ月 前
squeezellm.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 ヶ月 前
tpu_int8.py 3f49a55f82 feat: add INT8 W8A16 quant for TPU (#663) 4 ヶ月 前