提交历史

作者 SHA1 备注 提交日期
  AlpinDale 86bf2cc4f3 core: rename `PromptInputs,inputs` -> `PromptType,prompt` (#1080) 3 周之前
  AlpinDale 9a7d5514c4 feat: introduce MQAphroditeEngine (#1056) 1 月之前
  AlpinDale 638c08d9dc fix: clean shutdown issues (#1047) 1 月之前
  AlpinDale 05be6085ec core: factor out input preprocessing into a separate class (#1039) 1 月之前
  AlpinDale 055c8905a3 api: add sampling/engine option to return only deltas or final output (#1035) 1 月之前
  AlpinDale ddaefd8d38 chore: remove engine_use_ray (#1024) 1 月之前
  AlpinDale f561a54a43 core: fix async postprocessor in case of preemption (#1000) 1 月之前
  AlpinDale 55fa2b9c54 fix: raise exception when accessing logger for disable_log_stats=True case (#984) 1 月之前
  AlpinDale 09dab16f82 core: improve async postproc + multi-step performance (#983) 1 月之前
  AlpinDale b3f6eeb1d2 vlm: increase the default `max_num_batched_tokens` for multimodal models (#973) 1 月之前
  AlpinDale 0dfa6b60ec core: support logprobs with multi-step scheduling (#963) 1 月之前
  AlpinDale 34e8606e81 vlm: do not allow max_model_len overflow (#962) 1 月之前
  AlpinDale 5be6225f38 core: support multi-step scheduling w/ async post-processor (#955) 1 月之前
  AlpinDale 0e2bfccda0 core: add virtual engine for async outproc (#939) 1 月之前
  AlpinDale 15cb8d5c26 xpu: support pipeline parallel (#932) 1 月之前
  AlpinDale 8d9f1fd4e6 feat: add single user mode (#927) 1 月之前
  AlpinDale f7f3fed265 feat: add async postprocessor (#925) 1 月之前
  AlpinDale b1492c1529 core: add multi-step scheduling support for the synchronous engine (#914) 1 月之前
  AlpinDale 16e5b2be8b fix: empty prompt crashing the server (#912) 1 月之前
  AlpinDale 22a4cd4595 core: fix spec decode metrics and envs circular import (#889) 1 月之前
  AlpinDale 901900854e chore: consolidate environment variables within one file (#882) 1 月之前
  AlpinDale db6a50fd5c async: disable multi-step scheduling for sync engine (#880) 1 月之前
  AlpinDale 2242cb25dc fix: unbound tokenizer error 2 月之前
  AlpinDale 3d83e64f8e feat: add metrics for prefix cache hit rate (#829) 2 月之前
  AlpinDale 9d9722b1c1 fix: metrics endpoint with RPC server (#747) 4 月之前
  AlpinDale 89a2c6dee1 chore: refactor `MultiModalConfig` initialization and profiling (#745) 4 月之前
  AlpinDale 0b8b407b6d feat: support profiling with multiple multi-modal inputs per prompt (#712) 4 月之前
  AlpinDale f76f2a5af0 feat: add aphrodite plugin system (#705) 4 月之前
  AlpinDale 7debd35ca2 fix: shut down ray dag workers cleanly (#692) 4 月之前
  AlpinDale bf88c8567e feat: mamba model support (#674) 4 月之前