コミット履歴

作者 SHA1 メッセージ 日付
  AlpinDale 0e6c400b13 feat: re-add GGUF (#600) 4 ヶ月 前
  AlpinDale 9be43994fe feat: fbgemm quantization support (#601) 4 ヶ月 前
  AlpinDale 5289c14b24 feat: Asymmetric Tensor Parallel (#594) 4 ヶ月 前
  AlpinDale 0f4a9ee77b quantized lm_head (#582) 4 ヶ月 前
  AlpinDale ecd4460d55 fix: support 2D inputs for embeddings 5 ヶ月 前
  AlpinDale 6a57861fca feat: initial XPU support via intel_extension_for_pytorch (#571) 5 ヶ月 前
  AlpinDale c975bba905 fix: sharded state loader with lora 5 ヶ月 前
  AlpinDale 6fc1ec6e9a fix redirects and improve low level debugging 5 ヶ月 前
  AlpinDale fca911ee0a vLLM Upstream Sync (#526) 6 ヶ月 前
  AlpinDale 9d81716bfd [v0.5.3] Release Candidate (#388) 8 ヶ月 前
  AlpinDale 968bde81bf fix: tensor parallel with GPTQ and AWQ quants (#307) 10 ヶ月 前
  AlpinDale c41462cfcd feat: exllamav2 quantization (#305) 10 ヶ月 前
  AlpinDale 705821a7fe feat: AQLM quantization support (#293) 10 ヶ月 前
  TearGosling 80e8a14949 feat: add pygchat Jinja template (#218) 11 ヶ月 前
  AlpinDale 8635901c76 fix: s-lora vocab embeddings 11 ヶ月 前
  AlpinDale ea0f57b233 feat: allow further support for non-cuda devices (#247) 11 ヶ月 前
  AlpinDale c3a221eb02 feat: GGUF, QuIP#, and Marlin support (#228) 11 ヶ月 前
  AlpinDale c0aac15421 feat: S-LoRA support (#222) 11 ヶ月 前
  AlpinDale 8fa608aeb7 feat: replace Ray with NCCL for control plane comms (#221) 11 ヶ月 前
  AlpinDale 2755a48d51 merge dev branch into main (#153) 1 年間 前