AlpinDale 201db10f02 models: add support for Phi3 MoE há 2 semanas atrás
..
compressed_tensors 201db10f02 models: add support for Phi3 MoE há 2 semanas atrás
gguf_utils 8a71788372 Add OLMoE (#772) há 2 meses atrás
kernels f7f3fed265 feat: add async postprocessor (#925) há 2 semanas atrás
utils 93bc863591 feat: Machete Kernels for Hopper GPUs (#842) há 1 mês atrás
__init__.py f98e7b2f8c feat: add HQQ quantization support (#795) há 2 meses atrás
aqlm.py ccbda97416 fix: types in AQLM and GGUF for dynamo support (#736) há 3 meses atrás
autoquant.py f1d0b77c92 [0.6.0] Release Candidate (#481) há 4 meses atrás
awq.py edec2e9a9e feat: migrate awq and awq_marlin to AphroditeParameter (#702) há 4 meses atrás
awq_marlin.py 93bc863591 feat: Machete Kernels for Hopper GPUs (#842) há 1 mês atrás
awq_triton.py fcfcfc65e1 quants: add triton kernels for AWQ (#946) há 2 semanas atrás
base_config.py f1d0b77c92 [0.6.0] Release Candidate (#481) há 4 meses atrás
bitsandbytes.py 6bdff60aab quant: support pre-quanted bitsandbytes checkpoints (#961) há 2 semanas atrás
deepspeedfp.py f1d0b77c92 [0.6.0] Release Candidate (#481) há 4 meses atrás
eetq.py f1d0b77c92 [0.6.0] Release Candidate (#481) há 4 meses atrás
exl2.py f1d0b77c92 [0.6.0] Release Candidate (#481) há 4 meses atrás
experts_int8.py 201db10f02 models: add support for Phi3 MoE há 2 semanas atrás
fbgemm_fp8.py f1d0b77c92 [0.6.0] Release Candidate (#481) há 4 meses atrás
fp6.py 73177656ed feat: quant_llm support (#755) há 3 meses atrás
fp8.py 201db10f02 models: add support for Phi3 MoE há 2 semanas atrás
gguf.py 0dfa6b60ec core: support logprobs with multi-step scheduling (#963) há 2 semanas atrás
gptq.py ccbda97416 fix: types in AQLM and GGUF for dynamo support (#736) há 3 meses atrás
gptq_marlin.py 93bc863591 feat: Machete Kernels for Hopper GPUs (#842) há 1 mês atrás
gptq_marlin_24.py 5d9021969c quants: update `qqq` and `gptq_marlin_24` to use AphroditeParameters (#921) há 3 semanas atrás
hadamard.safetensors 9d81716bfd [v0.5.3] Release Candidate (#388) há 8 meses atrás
hqq_marlin.py f98e7b2f8c feat: add HQQ quantization support (#795) há 2 meses atrás
kv_cache.py f1d0b77c92 [0.6.0] Release Candidate (#481) há 4 meses atrás
marlin.py 799667737b quantization: update marlin to use `AphroditeParameters` (#913) há 3 semanas atrás
qqq.py 5d9021969c quants: update `qqq` and `gptq_marlin_24` to use AphroditeParameters (#921) há 3 semanas atrás
quip.py f1d0b77c92 [0.6.0] Release Candidate (#481) há 4 meses atrás
quip_utils.py 8a71788372 Add OLMoE (#772) há 2 meses atrás
schema.py 9d81716bfd [v0.5.3] Release Candidate (#388) há 8 meses atrás
squeezellm.py f1d0b77c92 [0.6.0] Release Candidate (#481) há 4 meses atrás
tpu_int8.py f4b62bf803 quant: update tpu_int8 to use AphroditeParameters (#959) há 2 semanas atrás