Commit History

Author SHA1 Message Date
  AlpinDale 4599c98f99 feat: dynamic image size support for VLMs 7 months ago
  AlpinDale 5be90c3859 Mamba infrastrucuture support (#586) 7 months ago
  AlpinDale ae04f57ec1 feat: Pipeline Parallel support (#581) 7 months ago
  AlpinDale cdff8e89f9 feat: introduce `DraftModelRunner` 7 months ago
  AlpinDale 405bb74612 Control plane comms refactor (#573) 7 months ago
  AlpinDale 8d77c69cbd feat: support image processor and add llava example 7 months ago
  AlpinDale e4ea3da1ad fix: tensor parallel with embedding model 7 months ago
  AlpinDale de62ceb18c refactor: eliminate parallel worker per-step task scheduling overhead 8 months ago
  AlpinDale a94de94c44 refactor: combine the prefill and decode into a single API (#553) 8 months ago
  AlpinDale 50b7c13db0 refactor: attention selector (#552) 8 months ago
  AlpinDale be8154a8a0 feat: proper embeddings API with e5-mistral-7b support 8 months ago