AlpinDale
|
4599c98f99
feat: dynamic image size support for VLMs
|
7 months ago |
AlpinDale
|
5be90c3859
Mamba infrastrucuture support (#586)
|
7 months ago |
AlpinDale
|
ae04f57ec1
feat: Pipeline Parallel support (#581)
|
7 months ago |
AlpinDale
|
cdff8e89f9
feat: introduce `DraftModelRunner`
|
7 months ago |
AlpinDale
|
405bb74612
Control plane comms refactor (#573)
|
7 months ago |
AlpinDale
|
8d77c69cbd
feat: support image processor and add llava example
|
7 months ago |
AlpinDale
|
e4ea3da1ad
fix: tensor parallel with embedding model
|
7 months ago |
AlpinDale
|
de62ceb18c
refactor: eliminate parallel worker per-step task scheduling overhead
|
8 months ago |
AlpinDale
|
a94de94c44
refactor: combine the prefill and decode into a single API (#553)
|
8 months ago |
AlpinDale
|
50b7c13db0
refactor: attention selector (#552)
|
8 months ago |
AlpinDale
|
be8154a8a0
feat: proper embeddings API with e5-mistral-7b support
|
8 months ago |