Commit History

Author SHA1 Message Date
  AlpinDale 26a717b49f fix: use head_dim if available 1 year ago
  AlpinDale 5053743c1c feat: speedup AWQ (#223) 1 year ago
  AlpinDale c0aac15421 feat: S-LoRA support (#222) 1 year ago
  AlpinDale 8fa608aeb7 feat: replace Ray with NCCL for control plane comms (#221) 1 year ago
  AlpinDale 3188d5690c fix: logprobs at -inf (#219) 1 year ago
  AlpinDale a39eeb7188 fix: logprobs for dynatemp (#215) 1 year ago
  Stefan Gligorijevic 9e7e108dc8 chore: clamp dynatemp_min (#214) 1 year ago
  AlpinDale 60f072ff6f chore: update klite embed and kcpp version (#212) 1 year ago
  AlpinDale 97f37c1cb2 fix: use tensor parallel for quantized mixtral (#213) 1 year ago
  Stefan Gligorijevic 56446a04bb feat: dynamic temperature (#209) 1 year ago
  AlpinDale 1394eab8ab fix temperature being set to 1 in all cases (#210) 1 year ago
  AlpinDale b3315c9a4c update the readme (#207) 1 year ago
  AlpinDale 9f77f35ff5 bump version to 0.4.6 (#204) 1 year ago
  AlpinDale fe70c6e8d5 feat: bump cuda and pytorch (#205) 1 year ago
  AlpinDale c5802b2bd5 fix: remove windows specific files 1 year ago
  AlpinDale 193287b2ef fix: mixtral unused import 1 year ago
  AlpinDale 53d391e1f2 merge 'dev' into 'main' 1 year ago
  AlpinDale e1f3fd1e02 fix: test units (#201) 1 year ago
  AlpinDale d54791aaa8 feat: reduce sampler overhead by making it less blocking (#198) 1 year ago
  AlpinDale 871c0ce8e4 fix: triton compile error (#200) 1 year ago
  AlpinDale 7e72ce0a73 feat: mixtral tensor parallelism (#193) 1 year ago
  AlpinDale d7f113c3ff readme: add acknowledgements 1 year ago
  AlpinDale 95bdd35ec9 feat: rejection sampler (#197) 1 year ago
  AlpinDale f121a5edd8 feat: tokenizer endpoint for OpenAI API (#195) 1 year ago
  AlpinDale 15a0454172 feat: FP8 KV Cache (#185) 1 year ago
  AlpinDale 801eda0b7a feat: support GPTQ 2, 3, and 8bit quants (#181) 1 year ago
  AlpinDale b9b295d74e chore: backlogs 1 (#191) 1 year ago
  AlpinDale 17cdc5ac23 yapf 1 year ago
  KaraKaraWitch 9a0b5a197d fix: set CPU Affinity (#187) 1 year ago
  AlpinDale 68c2083adb fix includes in wheel 1 year ago