Commit Verlauf

Autor SHA1 Nachricht Datum
  AlpinDale a4cbcfe59f feat: disable logprob serialization to CPU for spec decode vor 5 Monaten
  AlpinDale af43576da0 feat: add MLPSpeculator speculative decoding support (#572) vor 5 Monaten
  AlpinDale 4d1e613804 chore: minor simplifications vor 5 Monaten
  AlpinDale 5b0c11d190 support pipeline parallel pynccl groups vor 6 Monaten
  AlpinDale be8154a8a0 feat: proper embeddings API with e5-mistral-7b support vor 6 Monaten
  AlpinDale 79901b76de logprobs for target model (spec decoding) vor 6 Monaten
  AlpinDale 9d81716bfd [v0.5.3] Release Candidate (#388) vor 8 Monaten
  AlpinDale f8dfac6372 chore: attention refactor and upstream sync apr01 (#365) vor 10 Monaten