AlpinDale
|
99680b2d23
feat: soft prompts (#589)
|
5 months ago |
AlpinDale
|
4f7d212b70
feat: remove vision language config
|
5 months ago |
AlpinDale
|
4599c98f99
feat: dynamic image size support for VLMs
|
5 months ago |
AlpinDale
|
5be90c3859
Mamba infrastrucuture support (#586)
|
5 months ago |
AlpinDale
|
ae04f57ec1
feat: Pipeline Parallel support (#581)
|
5 months ago |
AlpinDale
|
cdff8e89f9
feat: introduce `DraftModelRunner`
|
5 months ago |
AlpinDale
|
c0c336aaa3
refactor: registry for processing model inputs; quick_gelu; clip model support
|
5 months ago |
AlpinDale
|
426a13ab73
fix: pass multi_modal_kwargs to CPU model runner
|
5 months ago |
AlpinDale
|
405bb74612
Control plane comms refactor (#573)
|
5 months ago |
AlpinDale
|
fdabb55a4d
fix: wrong multi_modal_input format for CPU
|
5 months ago |
AlpinDale
|
8d77c69cbd
feat: support image processor and add llava example
|
5 months ago |
AlpinDale
|
f6250c5516
move dockerfiles to root; fix cpu build
|
6 months ago |
AlpinDale
|
a94de94c44
refactor: combine the prefill and decode into a single API (#553)
|
6 months ago |
AlpinDale
|
50b7c13db0
refactor: attention selector (#552)
|
6 months ago |
AlpinDale
|
0e062e66d3
set block size at init
|
6 months ago |
AlpinDale
|
35ae01d7ba
refactor: attention metadata term
|
6 months ago |
AlpinDale
|
aed64884c6
feat: prompt logprobs with chunked prefill (#539)
|
6 months ago |
AlpinDale
|
fca911ee0a
vLLM Upstream Sync (#526)
|
6 months ago |
AlpinDale
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
8 months ago |