Commit History

Author SHA1 Message Date
  AlpinDale 50b7c13db0 refactor: attention selector (#552) 8 months ago
  AlpinDale 0e062e66d3 set block size at init 8 months ago
  AlpinDale 21ce19b3ea blocks_to_copy dict -> torch.Tensor 8 months ago
  AlpinDale ef733aee43 implement ExecuteModelData to reduce executor complexity 8 months ago
  AlpinDale 46159b107a formatting: pt1 8 months ago
  AlpinDale fca911ee0a vLLM Upstream Sync (#526) 8 months ago
  AlpinDale f894f7b176 Revert "reduce dedupe by wrapping in general worker class" 10 months ago
  AlpinDale 9fff6fb507 reduce dedupe by wrapping in general worker class 10 months ago
  AlpinDale 9d81716bfd [v0.5.3] Release Candidate (#388) 10 months ago