Histórico de Commits

Autor SHA1 Mensagem Data
  AlpinDale a94de94c44 refactor: combine the prefill and decode into a single API (#553) há 6 meses atrás
  AlpinDale 342346afda improve hashing function há 6 meses atrás
  AlpinDale be8154a8a0 feat: proper embeddings API with e5-mistral-7b support há 6 meses atrás
  AlpinDale 197a6d2c16 auto disable speculative decoding by the running queue size há 6 meses atrás
  AlpinDale 8b56dc4347 dict -> torch.Tensor for blocks_to_swap há 6 meses atrás
  AlpinDale 21ce19b3ea blocks_to_copy dict -> torch.Tensor há 6 meses atrás
  AlpinDale ef733aee43 implement ExecuteModelData to reduce executor complexity há 6 meses atrás
  AlpinDale 79901b76de logprobs for target model (spec decoding) há 6 meses atrás
  AlpinDale 2351a0e2cd feat: FlashInfer backend for decoding phase (#548) há 6 meses atrás
  AlpinDale b1555eb208 add new grafana metrics há 6 meses atrás
  AlpinDale aed64884c6 feat: prompt logprobs with chunked prefill (#539) há 6 meses atrás
  AlpinDale 9d81716bfd [v0.5.3] Release Candidate (#388) há 8 meses atrás
  AlpinDale 9181fa0396 feat: Triton kernels for sampling (#383) há 9 meses atrás
  AlpinDale f8dfac6372 chore: attention refactor and upstream sync apr01 (#365) há 9 meses atrás
  AlpinDale e42a78381a feat: switch from pylint to ruff (#322) há 10 meses atrás
  AlpinDale 9810daa699 feat: INT8 KV Cache (#298) há 10 meses atrás
  AlpinDale ac82b67f75 feat: naive context shift and various QoL changes (#289) há 10 meses atrás
  AlpinDale 657aec0cbd refactor: OpenAI endpoint (#261) há 10 meses atrás
  AlpinDale 4d04ade9ef feat: fine-grained seeds (#279) há 11 meses atrás
  AlpinDale d2db4143fa feat: add grafana for metrics (#240) há 11 meses atrás
  AlpinDale c0aac15421 feat: S-LoRA support (#222) há 1 ano atrás
  AlpinDale 8fa608aeb7 feat: replace Ray with NCCL for control plane comms (#221) há 1 ano atrás
  AlpinDale f013d714c0 chore: merge dev branch into main (#177) há 1 ano atrás
  AlpinDale 2755a48d51 merge dev branch into main (#153) há 1 ano atrás
  50h100a fa0ae5a2c9 feat: new mirostatv2 implementation (#96) há 1 ano atrás
  AlpinDale efc6f7fbec chore: reformats (#90) há 1 ano atrás
  AlpinDale e6be0118c9 feat: prompt logprobs and batched samplers (#77) há 1 ano atrás
  AlpinDale 75c27d3e65 massive overhaul há 1 ano atrás
  AlpinDale 6dfca14e1f compute logprobs with log_softmax instead of log há 1 ano atrás
  AlpinDale 6b9561ef07 adapt TGI incremental detokenization há 1 ano atrás