Commit History

Author SHA1 Message Date
  AlpinDale 8d77c69cbd feat: support image processor and add llava example 7 months ago
  AlpinDale e4ea3da1ad fix: tensor parallel with embedding model 7 months ago
  AlpinDale de62ceb18c refactor: eliminate parallel worker per-step task scheduling overhead 7 months ago
  AlpinDale a94de94c44 refactor: combine the prefill and decode into a single API (#553) 7 months ago
  AlpinDale 50b7c13db0 refactor: attention selector (#552) 8 months ago
  AlpinDale be8154a8a0 feat: proper embeddings API with e5-mistral-7b support 8 months ago