sgsdxzy
|
214151b04c
fix: max_num_batched_tokens for chunked_prefill (#412)
|
9 mesiacov pred |
sgsdxzy
|
6a0a6360f1
fix: Allow setting config-path when converting ggufs. (#410)
|
9 mesiacov pred |
sgsdxzy
|
fcfb72af24
Support arbitrary model in GGUF. (#381)
|
9 mesiacov pred |
AlpinDale
|
e42a78381a
feat: switch from pylint to ruff (#322)
|
10 mesiacov pred |
sgsdxzy
|
fe7844f2ef
feat: sharding and safetensors support for gguf conversion (#256)
|
1 rok pred |
AlpinDale
|
c3a221eb02
feat: GGUF, QuIP#, and Marlin support (#228)
|
1 rok pred |