.. |
compressed_tensors
|
f1e1d0bd3d
feat: introduce `BaseAphroditeParameter` (#646)
|
4 месяцев назад |
gguf_utils
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
8 месяцев назад |
utils
|
3170c0d4c6
fix: GPTQ/AWQ on Colab (#655)
|
4 месяцев назад |
__init__.py
|
3f49a55f82
feat: add INT8 W8A16 quant for TPU (#663)
|
4 месяцев назад |
aqlm.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 месяцев назад |
autoquant.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 месяцев назад |
awq.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 месяцев назад |
awq_marlin.py
|
3170c0d4c6
fix: GPTQ/AWQ on Colab (#655)
|
4 месяцев назад |
base_config.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 месяцев назад |
bitsandbytes.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 месяцев назад |
deepspeedfp.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 месяцев назад |
eetq.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 месяцев назад |
exl2.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 месяцев назад |
fbgemm_fp8.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 месяцев назад |
fp8.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 месяцев назад |
gguf.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 месяцев назад |
gptq.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 месяцев назад |
gptq_marlin.py
|
3170c0d4c6
fix: GPTQ/AWQ on Colab (#655)
|
4 месяцев назад |
gptq_marlin_24.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 месяцев назад |
hadamard.safetensors
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
8 месяцев назад |
kv_cache.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 месяцев назад |
marlin.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 месяцев назад |
qqq.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 месяцев назад |
quip.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 месяцев назад |
quip_utils.py
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
8 месяцев назад |
schema.py
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
8 месяцев назад |
squeezellm.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 месяцев назад |
tpu_int8.py
|
3f49a55f82
feat: add INT8 W8A16 quant for TPU (#663)
|
4 месяцев назад |