Commit History

Author SHA1 Message Date
  AlpinDale 9183c13b5b add gumbel softmax 11 months ago
  AlpinDale b3315c9a4c update the readme (#207) 11 months ago
  AlpinDale 9f77f35ff5 bump version to 0.4.6 (#204) 1 year ago
  AlpinDale fe70c6e8d5 feat: bump cuda and pytorch (#205) 1 year ago
  AlpinDale c5802b2bd5 fix: remove windows specific files 1 year ago
  AlpinDale 193287b2ef fix: mixtral unused import 1 year ago
  AlpinDale 53d391e1f2 merge 'dev' into 'main' 1 year ago
  AlpinDale e1f3fd1e02 fix: test units (#201) 1 year ago
  AlpinDale d54791aaa8 feat: reduce sampler overhead by making it less blocking (#198) 1 year ago
  AlpinDale 871c0ce8e4 fix: triton compile error (#200) 1 year ago
  AlpinDale 7e72ce0a73 feat: mixtral tensor parallelism (#193) 1 year ago
  AlpinDale d7f113c3ff readme: add acknowledgements 1 year ago
  AlpinDale 95bdd35ec9 feat: rejection sampler (#197) 1 year ago
  AlpinDale f121a5edd8 feat: tokenizer endpoint for OpenAI API (#195) 1 year ago
  AlpinDale 15a0454172 feat: FP8 KV Cache (#185) 1 year ago
  AlpinDale 801eda0b7a feat: support GPTQ 2, 3, and 8bit quants (#181) 1 year ago
  AlpinDale b9b295d74e chore: backlogs 1 (#191) 1 year ago
  AlpinDale 17cdc5ac23 yapf 1 year ago
  KaraKaraWitch 9a0b5a197d fix: set CPU Affinity (#187) 1 year ago
  AlpinDale 68c2083adb fix includes in wheel 1 year ago
  AlpinDale 11af9b796e bump version to 0.4.5 1 year ago
  AlpinDale 3736d831f2 fix klite embed 1 year ago
  AlpinDale 81d7a8d323 bump version to 0.4.4 1 year ago
  AlpinDale f013d714c0 chore: merge dev branch into main (#177) 1 year ago
  AlpinDale 6eb0b926fd chore: make openai api key optional (#176) 1 year ago
  g4rg fe57bb7ad2 feat: add rope scaling to mixtral (#174) 1 year ago
  AlpinDale 7d91e9e0f2 feat: CUDA graphs (#172) 1 year ago
  AlpinDale 725be3e0de feat: mixtral HF with expert parallelism (#167) 1 year ago
  AlpinDale 6c50f5b067 chore: include stop strings in output (#168) 1 year ago
  g4rg 2aab3da9bd chore: fix Python 3.8+ compatibility (#170) 1 year ago