.. |
compressed_tensors
|
201db10f02
models: add support for Phi3 MoE
|
2 viikkoa sitten |
gguf_utils
|
8a71788372
Add OLMoE (#772)
|
2 kuukautta sitten |
kernels
|
f7f3fed265
feat: add async postprocessor (#925)
|
2 viikkoa sitten |
utils
|
93bc863591
feat: Machete Kernels for Hopper GPUs (#842)
|
1 kuukausi sitten |
__init__.py
|
f98e7b2f8c
feat: add HQQ quantization support (#795)
|
2 kuukautta sitten |
aqlm.py
|
ccbda97416
fix: types in AQLM and GGUF for dynamo support (#736)
|
3 kuukautta sitten |
autoquant.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 kuukautta sitten |
awq.py
|
edec2e9a9e
feat: migrate awq and awq_marlin to AphroditeParameter (#702)
|
4 kuukautta sitten |
awq_marlin.py
|
93bc863591
feat: Machete Kernels for Hopper GPUs (#842)
|
1 kuukausi sitten |
awq_triton.py
|
fcfcfc65e1
quants: add triton kernels for AWQ (#946)
|
2 viikkoa sitten |
base_config.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 kuukautta sitten |
bitsandbytes.py
|
6bdff60aab
quant: support pre-quanted bitsandbytes checkpoints (#961)
|
2 viikkoa sitten |
deepspeedfp.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 kuukautta sitten |
eetq.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 kuukautta sitten |
exl2.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 kuukautta sitten |
experts_int8.py
|
201db10f02
models: add support for Phi3 MoE
|
2 viikkoa sitten |
fbgemm_fp8.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 kuukautta sitten |
fp6.py
|
73177656ed
feat: quant_llm support (#755)
|
3 kuukautta sitten |
fp8.py
|
201db10f02
models: add support for Phi3 MoE
|
2 viikkoa sitten |
gguf.py
|
0dfa6b60ec
core: support logprobs with multi-step scheduling (#963)
|
2 viikkoa sitten |
gptq.py
|
ccbda97416
fix: types in AQLM and GGUF for dynamo support (#736)
|
3 kuukautta sitten |
gptq_marlin.py
|
93bc863591
feat: Machete Kernels for Hopper GPUs (#842)
|
1 kuukausi sitten |
gptq_marlin_24.py
|
5d9021969c
quants: update `qqq` and `gptq_marlin_24` to use AphroditeParameters (#921)
|
3 viikkoa sitten |
hadamard.safetensors
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
8 kuukautta sitten |
hqq_marlin.py
|
f98e7b2f8c
feat: add HQQ quantization support (#795)
|
2 kuukautta sitten |
kv_cache.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 kuukautta sitten |
marlin.py
|
799667737b
quantization: update marlin to use `AphroditeParameters` (#913)
|
3 viikkoa sitten |
qqq.py
|
5d9021969c
quants: update `qqq` and `gptq_marlin_24` to use AphroditeParameters (#921)
|
3 viikkoa sitten |
quip.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 kuukautta sitten |
quip_utils.py
|
8a71788372
Add OLMoE (#772)
|
2 kuukautta sitten |
schema.py
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
8 kuukautta sitten |
squeezellm.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 kuukautta sitten |
tpu_int8.py
|
f4b62bf803
quant: update tpu_int8 to use AphroditeParameters (#959)
|
2 viikkoa sitten |