AlpinDale 3f49a55f82 feat: add INT8 W8A16 quant for TPU (#663) 4 months ago
..
compressed_tensors f1e1d0bd3d feat: introduce `BaseAphroditeParameter` (#646) 4 months ago
gguf_utils 9d81716bfd [v0.5.3] Release Candidate (#388) 8 months ago
utils 3170c0d4c6 fix: GPTQ/AWQ on Colab (#655) 4 months ago
__init__.py 3f49a55f82 feat: add INT8 W8A16 quant for TPU (#663) 4 months ago
aqlm.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 months ago
autoquant.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 months ago
awq.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 months ago
awq_marlin.py 3170c0d4c6 fix: GPTQ/AWQ on Colab (#655) 4 months ago
base_config.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 months ago
bitsandbytes.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 months ago
deepspeedfp.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 months ago
eetq.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 months ago
exl2.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 months ago
fbgemm_fp8.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 months ago
fp8.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 months ago
gguf.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 months ago
gptq.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 months ago
gptq_marlin.py 3170c0d4c6 fix: GPTQ/AWQ on Colab (#655) 4 months ago
gptq_marlin_24.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 months ago
hadamard.safetensors 9d81716bfd [v0.5.3] Release Candidate (#388) 8 months ago
kv_cache.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 months ago
marlin.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 months ago
qqq.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 months ago
quip.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 months ago
quip_utils.py 9d81716bfd [v0.5.3] Release Candidate (#388) 8 months ago
schema.py 9d81716bfd [v0.5.3] Release Candidate (#388) 8 months ago
squeezellm.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 months ago
tpu_int8.py 3f49a55f82 feat: add INT8 W8A16 quant for TPU (#663) 4 months ago