AlpinDale
|
b6c4dfce23
chore: refactor TPU model runner and worker
|
4 月之前 |
AlpinDale
|
1ff6d4c3d7
feat: support pipeline parallel on indivisible GPU count (#587)
|
4 月之前 |
AlpinDale
|
4f7d212b70
feat: remove vision language config
|
4 月之前 |
AlpinDale
|
d0ff3fd59e
fix: tpu sampler output
|
5 月之前 |
AlpinDale
|
d2461161ec
chore: optimize KV cache swapping for TPU
|
5 月之前 |
AlpinDale
|
8b626e4032
fix: cpu kv cache allocation for TPU
|
5 月之前 |
AlpinDale
|
fcd58614f4
feat: support parallel sampling and swapping in TPU
|
5 月之前 |
AlpinDale
|
af1286f9fa
fix: kv cache size calculation on TPUs
|
5 月之前 |
AlpinDale
|
608e8e1310
chore: refactor TPU backend to make it more similar to GPU backend
|
5 月之前 |
AlpinDale
|
a524667db0
fix: device assertion for sdpa backend; fix env for tpu worker
|
5 月之前 |
AlpinDale
|
fe21123a1c
feat: TPU support (#570)
|
5 月之前 |