Commit History

Author SHA1 Message Date
  AlpinDale d0ff3fd59e fix: tpu sampler output 7 months ago
  AlpinDale d2461161ec chore: optimize KV cache swapping for TPU 7 months ago
  AlpinDale 8b626e4032 fix: cpu kv cache allocation for TPU 7 months ago
  AlpinDale fcd58614f4 feat: support parallel sampling and swapping in TPU 7 months ago
  AlpinDale af1286f9fa fix: kv cache size calculation on TPUs 7 months ago
  AlpinDale 608e8e1310 chore: refactor TPU backend to make it more similar to GPU backend 7 months ago
  AlpinDale a524667db0 fix: device assertion for sdpa backend; fix env for tpu worker 7 months ago
  AlpinDale fe21123a1c feat: TPU support (#570) 7 months ago