AlpinDale
|
50b7c13db0
refactor: attention selector (#552)
|
8 月之前 |
AlpinDale
|
0e062e66d3
set block size at init
|
8 月之前 |
AlpinDale
|
21ce19b3ea
blocks_to_copy dict -> torch.Tensor
|
8 月之前 |
AlpinDale
|
ef733aee43
implement ExecuteModelData to reduce executor complexity
|
8 月之前 |
AlpinDale
|
46159b107a
formatting: pt1
|
8 月之前 |
AlpinDale
|
fca911ee0a
vLLM Upstream Sync (#526)
|
8 月之前 |
AlpinDale
|
f894f7b176
Revert "reduce dedupe by wrapping in general worker class"
|
10 月之前 |
AlpinDale
|
9fff6fb507
reduce dedupe by wrapping in general worker class
|
10 月之前 |
AlpinDale
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
10 月之前 |