Commit Verlauf

Autor SHA1 Nachricht Datum
  AlpinDale 86bf2cc4f3 core: rename `PromptInputs,inputs` -> `PromptType,prompt` (#1080) vor 3 Wochen
  AlpinDale 9a7d5514c4 feat: introduce MQAphroditeEngine (#1056) vor 1 Monat
  AlpinDale 638c08d9dc fix: clean shutdown issues (#1047) vor 1 Monat
  AlpinDale 05be6085ec core: factor out input preprocessing into a separate class (#1039) vor 1 Monat
  AlpinDale 055c8905a3 api: add sampling/engine option to return only deltas or final output (#1035) vor 1 Monat
  AlpinDale ddaefd8d38 chore: remove engine_use_ray (#1024) vor 1 Monat
  AlpinDale f561a54a43 core: fix async postprocessor in case of preemption (#1000) vor 1 Monat
  AlpinDale 55fa2b9c54 fix: raise exception when accessing logger for disable_log_stats=True case (#984) vor 1 Monat
  AlpinDale 09dab16f82 core: improve async postproc + multi-step performance (#983) vor 1 Monat
  AlpinDale b3f6eeb1d2 vlm: increase the default `max_num_batched_tokens` for multimodal models (#973) vor 1 Monat
  AlpinDale 0dfa6b60ec core: support logprobs with multi-step scheduling (#963) vor 1 Monat
  AlpinDale 34e8606e81 vlm: do not allow max_model_len overflow (#962) vor 1 Monat
  AlpinDale 5be6225f38 core: support multi-step scheduling w/ async post-processor (#955) vor 1 Monat
  AlpinDale 0e2bfccda0 core: add virtual engine for async outproc (#939) vor 1 Monat
  AlpinDale 15cb8d5c26 xpu: support pipeline parallel (#932) vor 1 Monat
  AlpinDale 8d9f1fd4e6 feat: add single user mode (#927) vor 1 Monat
  AlpinDale f7f3fed265 feat: add async postprocessor (#925) vor 1 Monat
  AlpinDale b1492c1529 core: add multi-step scheduling support for the synchronous engine (#914) vor 1 Monat
  AlpinDale 16e5b2be8b fix: empty prompt crashing the server (#912) vor 1 Monat
  AlpinDale 22a4cd4595 core: fix spec decode metrics and envs circular import (#889) vor 1 Monat
  AlpinDale 901900854e chore: consolidate environment variables within one file (#882) vor 1 Monat
  AlpinDale db6a50fd5c async: disable multi-step scheduling for sync engine (#880) vor 1 Monat
  AlpinDale 2242cb25dc fix: unbound tokenizer error vor 2 Monaten
  AlpinDale 3d83e64f8e feat: add metrics for prefix cache hit rate (#829) vor 2 Monaten
  AlpinDale 9d9722b1c1 fix: metrics endpoint with RPC server (#747) vor 4 Monaten
  AlpinDale 89a2c6dee1 chore: refactor `MultiModalConfig` initialization and profiling (#745) vor 4 Monaten
  AlpinDale 0b8b407b6d feat: support profiling with multiple multi-modal inputs per prompt (#712) vor 4 Monaten
  AlpinDale f76f2a5af0 feat: add aphrodite plugin system (#705) vor 4 Monaten
  AlpinDale 7debd35ca2 fix: shut down ray dag workers cleanly (#692) vor 4 Monaten
  AlpinDale bf88c8567e feat: mamba model support (#674) vor 4 Monaten