AlpinDale
|
5289c14b24
feat: Asymmetric Tensor Parallel (#594)
|
4 月之前 |
AlpinDale
|
99680b2d23
feat: soft prompts (#589)
|
4 月之前 |
AlpinDale
|
4f7d212b70
feat: remove vision language config
|
4 月之前 |
AlpinDale
|
4599c98f99
feat: dynamic image size support for VLMs
|
4 月之前 |
AlpinDale
|
5be90c3859
Mamba infrastrucuture support (#586)
|
4 月之前 |
AlpinDale
|
ae04f57ec1
feat: Pipeline Parallel support (#581)
|
4 月之前 |
AlpinDale
|
cdff8e89f9
feat: introduce `DraftModelRunner`
|
5 月之前 |
AlpinDale
|
405bb74612
Control plane comms refactor (#573)
|
5 月之前 |
AlpinDale
|
8d77c69cbd
feat: support image processor and add llava example
|
5 月之前 |
AlpinDale
|
e4ea3da1ad
fix: tensor parallel with embedding model
|
5 月之前 |
AlpinDale
|
de62ceb18c
refactor: eliminate parallel worker per-step task scheduling overhead
|
5 月之前 |
AlpinDale
|
a94de94c44
refactor: combine the prefill and decode into a single API (#553)
|
5 月之前 |
AlpinDale
|
50b7c13db0
refactor: attention selector (#552)
|
5 月之前 |
AlpinDale
|
be8154a8a0
feat: proper embeddings API with e5-mistral-7b support
|
5 月之前 |