AlpinDale
|
4d4e767838
ci: take one of fixing lint issues
|
5 月之前 |
AlpinDale
|
6c2e24de53
fix: support flashinfer for draft model runner
|
5 月之前 |
AlpinDale
|
b15e6376f8
bump to torch 2.4.0, add aphrodite_flash_attn (#614)
|
5 月之前 |
AlpinDale
|
705e50f4bd
fix: broadcasting logic for multi_modal_kwargs
|
5 月之前 |
AlpinDale
|
a4cbcfe59f
feat: disable logprob serialization to CPU for spec decode
|
5 月之前 |
AlpinDale
|
fa15bad2ea
chore: minor AMD fixes
|
5 月之前 |
AlpinDale
|
8ee8483fcf
`enable_gpu_advance_step` -> `allo_gpu_advance_step`
|
5 月之前 |
AlpinDale
|
dd18c5042c
move prepare_inputs to the GPU (#596)
|
5 月之前 |
AlpinDale
|
5289c14b24
feat: Asymmetric Tensor Parallel (#594)
|
5 月之前 |
AlpinDale
|
99680b2d23
feat: soft prompts (#589)
|
5 月之前 |
AlpinDale
|
4f7d212b70
feat: remove vision language config
|
5 月之前 |
AlpinDale
|
5be90c3859
Mamba infrastrucuture support (#586)
|
5 月之前 |
AlpinDale
|
ae04f57ec1
feat: Pipeline Parallel support (#581)
|
5 月之前 |
AlpinDale
|
cdff8e89f9
feat: introduce `DraftModelRunner`
|
5 月之前 |