AlpinDale
|
8d77c69cbd
feat: support image processor and add llava example
|
7 months ago |
AlpinDale
|
e4ea3da1ad
fix: tensor parallel with embedding model
|
7 months ago |
AlpinDale
|
de62ceb18c
refactor: eliminate parallel worker per-step task scheduling overhead
|
7 months ago |
AlpinDale
|
a94de94c44
refactor: combine the prefill and decode into a single API (#553)
|
7 months ago |
AlpinDale
|
50b7c13db0
refactor: attention selector (#552)
|
8 months ago |
AlpinDale
|
be8154a8a0
feat: proper embeddings API with e5-mistral-7b support
|
8 months ago |