Commit History

Author SHA1 Message Date
  sgsdxzy 214151b04c fix: max_num_batched_tokens for chunked_prefill (#412) 9 months ago
  sgsdxzy 6a0a6360f1 fix: Allow setting config-path when converting ggufs. (#410) 9 months ago
  sgsdxzy fcfb72af24 Support arbitrary model in GGUF. (#381) 9 months ago
  AlpinDale e42a78381a feat: switch from pylint to ruff (#322) 10 months ago
  sgsdxzy fe7844f2ef feat: sharding and safetensors support for gguf conversion (#256) 1 year ago
  AlpinDale c3a221eb02 feat: GGUF, QuIP#, and Marlin support (#228) 1 year ago