Commit History

Author SHA1 Message Date
  AlpinDale 99680b2d23 feat: soft prompts (#589) 5 months ago
  AlpinDale 4f7d212b70 feat: remove vision language config 5 months ago
  AlpinDale 4599c98f99 feat: dynamic image size support for VLMs 5 months ago
  AlpinDale 5be90c3859 Mamba infrastrucuture support (#586) 5 months ago
  AlpinDale ae04f57ec1 feat: Pipeline Parallel support (#581) 5 months ago
  AlpinDale cdff8e89f9 feat: introduce `DraftModelRunner` 5 months ago
  AlpinDale c0c336aaa3 refactor: registry for processing model inputs; quick_gelu; clip model support 5 months ago
  AlpinDale 426a13ab73 fix: pass multi_modal_kwargs to CPU model runner 5 months ago
  AlpinDale 405bb74612 Control plane comms refactor (#573) 5 months ago
  AlpinDale fdabb55a4d fix: wrong multi_modal_input format for CPU 5 months ago
  AlpinDale 8d77c69cbd feat: support image processor and add llava example 5 months ago
  AlpinDale f6250c5516 move dockerfiles to root; fix cpu build 6 months ago
  AlpinDale a94de94c44 refactor: combine the prefill and decode into a single API (#553) 6 months ago
  AlpinDale 50b7c13db0 refactor: attention selector (#552) 6 months ago
  AlpinDale 0e062e66d3 set block size at init 6 months ago
  AlpinDale 35ae01d7ba refactor: attention metadata term 6 months ago
  AlpinDale aed64884c6 feat: prompt logprobs with chunked prefill (#539) 6 months ago
  AlpinDale fca911ee0a vLLM Upstream Sync (#526) 6 months ago
  AlpinDale 9d81716bfd [v0.5.3] Release Candidate (#388) 8 months ago