Commit Verlauf

Autor SHA1 Nachricht Datum
  AlpinDale 0dfa6b60ec core: support logprobs with multi-step scheduling (#963) vor 1 Monat
  AlpinDale 5be6225f38 core: support multi-step scheduling w/ async post-processor (#955) vor 1 Monat
  AlpinDale 0e2bfccda0 core: add virtual engine for async outproc (#939) vor 1 Monat
  AlpinDale 15cb8d5c26 xpu: support pipeline parallel (#932) vor 1 Monat
  AlpinDale f7f3fed265 feat: add async postprocessor (#925) vor 1 Monat
  AlpinDale b1492c1529 core: add multi-step scheduling support for the synchronous engine (#914) vor 1 Monat
  AlpinDale 22a4cd4595 core: fix spec decode metrics and envs circular import (#889) vor 1 Monat
  AlpinDale 901900854e chore: consolidate environment variables within one file (#882) vor 1 Monat
  AlpinDale b5aa11020b api: fix crashes under very high loads (#878) vor 1 Monat
  AlpinDale 5bd4473bb6 async: avoid premature exit in the async generator (#872) vor 1 Monat
  AlpinDale 48a8693aed feat: multi-step scheduling (#831) vor 2 Monaten
  AlpinDale dcb794a340 fix: revert incorrect commit vor 3 Monaten
  AlpinDale 76367b5ae7 wip vor 3 Monaten
  AlpinDale 9d9722b1c1 fix: metrics endpoint with RPC server (#747) vor 4 Monaten
  AlpinDale 7debd35ca2 fix: shut down ray dag workers cleanly (#692) vor 4 Monaten
  AlpinDale 19ad952dd4 chore: better stream termination in async engine (#672) vor 4 Monaten
  AlpinDale 62111fab17 feat: allow serving encoder-decoder models in the API server (#664) vor 4 Monaten
  AlpinDale ed9a6f97c1 fix: kill api server when pinging dead engine (#660) vor 4 Monaten
  AlpinDale 77c4fbd5c9 fix: better async request cancellation (#641) vor 4 Monaten
  AlpinDale f1d0b77c92 [0.6.0] Release Candidate (#481) vor 4 Monaten
  AlpinDale 9d81716bfd [v0.5.3] Release Candidate (#388) vor 8 Monaten
  AlpinDale 0f6d56b07f feat: model executor refactor (#367) vor 9 Monaten
  AlpinDale b361096463 fix: tokenizer when using ray (#366) vor 9 Monaten
  AlpinDale f8dfac6372 chore: attention refactor and upstream sync apr01 (#365) vor 9 Monaten
  AlpinDale c2d77b1822 chore: logging refactor (#302) vor 10 Monaten
  AlpinDale 2d3d44b3e9 chore: add health check for ray workers (#290) vor 10 Monaten
  AlpinDale ac82b67f75 feat: naive context shift and various QoL changes (#289) vor 10 Monaten
  AlpinDale 657aec0cbd refactor: OpenAI endpoint (#261) vor 10 Monaten
  AlpinDale 6a63ab4ec3 fix: remote encode request if using ray (#270) vor 11 Monaten
  AlpinDale c0146ed00e chore: slight refactor for async engine finish (#225) vor 1 Jahr