Commit History

Author SHA1 Message Date
  g4rg 2aab3da9bd chore: fix Python 3.8+ compatibility (#170) 1 year ago
  AlpinDale 7c6fdea535 fix: GPTQ warnings and exllama states (#171) 1 year ago
  AlpinDale f5f9bc6a7c fix: memory profiling (#166) 1 year ago
  AlpinDale 730357c7d5 chore: implement lazy module loader for models (#165) 1 year ago
  AlpinDale 35e9cf707c chore: force pt for mixtral (#164) 1 year ago
  AlpinDale ae57df0f44 fix: sliding window for mistral/mixtral (#163) 1 year ago
  AlpinDale 844aec2544 fix: prompt logprobs (#162) 1 year ago
  AlpinDale 02f3ab3501 fix: replace head_mapping with num_kv_heads (#161) 1 year ago
  AlpinDale a92f63d8c0 fix: OpenAI chat - reference before assignment (#160) 1 year ago
  AlpinDale dc5aedcfa7 fix: llama keyerror (#159) 1 year ago
  AlpinDale 980673ffb7 fix: fractional gpus (#157) 1 year ago
  AlpinDale 4ec78676b9 feat: implement Deci-LM (#158) 1 year ago
  AlpinDale ef04ab5c4d chore: implement torch native layers (#156) 1 year ago
  AlpinDale cb9beb32b0 merge main 1 year ago
  AlpinDale b9df8734d1 remove megablocks as a dependency 1 year ago
  AlpinDale 23a54389c8 bump torch 1 year ago
  AlpinDale 4a016fcbd9 limit number of processes to prevent OOM in CI 1 year ago
  AlpinDale 282675e4a1 bump version to 0.4.3 1 year ago
  AlpinDale 2755a48d51 merge dev branch into main (#153) 1 year ago
  AlpinDale 7dfcac677d update readme 1 year ago
  AlpinDale 87277c76e4 feat: Mixtral 8x7B support (#155) 1 year ago
  AlpinDale 81e7981dce feat: add prometheus production metrics (#154) 1 year ago
  AlpinDale 62b2c4119d feat: re-write GPTQ and refactor exllama kernels (#152) 1 year ago
  AlpinDale 8ed7d56305 feat: OpenAI chat completions templates (#138) 1 year ago
  AlpinDale 653da510d1 chore: rewrite InputMetadata (#143) 1 year ago
  AlpinDale 5dbd5f8c30 fix: quant TP (#129) 1 year ago
  AlpinDale 1334a833a4 feat: AMD ROCm support (#95) 1 year ago
  AlpinDale 63c28919a0 Revert "fix: correct auto ntk scaling_factor for 4k ctx case" (#149) 1 year ago
  g4rg 2c5b0268a4 chore: KoboldAI/koboldcpp updates (#48) 1 year ago
  AlpinDale e386032ae8 fix: rope duplication (#142) 1 year ago