Historique des commits

Auteur SHA1 Message Date
  AlpinDale ac79d115b3 add guards for prefix caching, fp8, chunked, etc il y a 5 mois
  AlpinDale 656459fd84 make fp8_e4m3 work on nvidia il y a 5 mois
  AlpinDale 60e74e92fd add rope_scaling arg il y a 5 mois
  AlpinDale 9e73559eba make use of batched rotary embedding kernels to support long context lora il y a 5 mois
  AlpinDale c66b1b57b1 Marlin 2:4 sparsity (#555) il y a 5 mois
  AlpinDale 7bcff4ac03 implement sharded state dict il y a 5 mois
  AlpinDale 13e5ffd456 fix distributed_executor_backend in args il y a 5 mois
  AlpinDale c6a501f682 add multiprocessing executor; make ray optional il y a 5 mois
  AlpinDale e42d0b3455 possibly improve ngram efficiency il y a 5 mois
  AlpinDale be8154a8a0 feat: proper embeddings API with e5-mistral-7b support il y a 5 mois
  AlpinDale 4acf34417a feat: add DeepSpeedFP quantization for all models il y a 5 mois
  AlpinDale 197a6d2c16 auto disable speculative decoding by the running queue size il y a 5 mois
  AlpinDale 4476d2d885 remove cuda version check il y a 5 mois
  AlpinDale 2351a0e2cd feat: FlashInfer backend for decoding phase (#548) il y a 5 mois
  AlpinDale 35ae01d7ba refactor: attention metadata term il y a 5 mois
  AlpinDale 723c6acb84 re-add ngram speculative decoding il y a 5 mois
  AlpinDale f22b700ee4 feat: marlin kernels for GPTQ (#547) il y a 5 mois
  AlpinDale 110a2724f4 extended -> llama3, also make rope_type in config work il y a 5 mois
  AlpinDale e87c32bed3 feat: full tensor parallel for LoRA layers (#545) il y a 5 mois
  AlpinDale 3ab36e6b2d feat: extended RoPE for Llama 3.1 (#543) il y a 5 mois
  AlpinDale e7b1368156 feat: Phi3 support il y a 6 mois
  AlpinDale 46159b107a formatting: pt1 il y a 6 mois
  AlpinDale fca911ee0a vLLM Upstream Sync (#526) il y a 6 mois
  AlpinDale 42998e423c better quant verification il y a 7 mois
  AlpinDale 9d81716bfd [v0.5.3] Release Candidate (#388) il y a 8 mois
  AlpinDale 78d66f16d1 Chunked Prefill Part 1 (#384) il y a 9 mois
  AlpinDale feb5840f2a feat: async tokenization (#374) il y a 9 mois
  AlpinDale 29c241c115 fix: explicitly disallow installation on non-linux platforms (#373) il y a 9 mois
  AlpinDale 97a2b26c97 fix: assertion error when use_sliding_window is present il y a 9 mois
  AlpinDale 0f6d56b07f feat: model executor refactor (#367) il y a 9 mois