1
0
AlpinDale 8d26cf3876 simplify model_executor logic 9 сар өмнө
..
__init__.py bd0ddf1cfe feat: EETQ quantization (#408) 10 сар өмнө
aqlm.py 41beab5dc1 add exllamav2 tensor paralell, fused MoE for GPTQ/AWQ 10 сар өмнө
awq.py 41beab5dc1 add exllamav2 tensor paralell, fused MoE for GPTQ/AWQ 10 сар өмнө
base_config.py aa244761ed formatting and typing 10 сар өмнө
bitsandbytes.py fa083286e3 Speculative Decoding Part 4: Lookahead scheduling (#402) 10 сар өмнө
eetq.py 8d26cf3876 simplify model_executor logic 9 сар өмнө
exl2.py ea26c91e52 proper typing 10 сар өмнө
gguf.py ea26c91e52 proper typing 10 сар өмнө
gptq.py 41beab5dc1 add exllamav2 tensor paralell, fused MoE for GPTQ/AWQ 10 сар өмнө
hadamard.safetensors c3a221eb02 feat: GGUF, QuIP#, and Marlin support (#228) 1 жил өмнө
marlin.py ea26c91e52 proper typing 10 сар өмнө
quip.py ea26c91e52 proper typing 10 сар өмнө
quip_utils.py e42a78381a feat: switch from pylint to ruff (#322) 11 сар өмнө
schema.py 7528e0ce3e make detokenization optional 10 сар өмнө
squeezellm.py ea26c91e52 proper typing 10 сар өмнө