Commit History

Autor SHA1 Mensaxe Data
  AlpinDale b6c4dfce23 chore: refactor TPU model runner and worker hai 5 meses
  AlpinDale 1ff6d4c3d7 feat: support pipeline parallel on indivisible GPU count (#587) hai 5 meses
  AlpinDale 4f7d212b70 feat: remove vision language config hai 5 meses
  AlpinDale d0ff3fd59e fix: tpu sampler output hai 6 meses
  AlpinDale d2461161ec chore: optimize KV cache swapping for TPU hai 6 meses
  AlpinDale 8b626e4032 fix: cpu kv cache allocation for TPU hai 6 meses
  AlpinDale fcd58614f4 feat: support parallel sampling and swapping in TPU hai 6 meses
  AlpinDale af1286f9fa fix: kv cache size calculation on TPUs hai 6 meses
  AlpinDale 608e8e1310 chore: refactor TPU backend to make it more similar to GPU backend hai 6 meses
  AlpinDale a524667db0 fix: device assertion for sdpa backend; fix env for tpu worker hai 6 meses
  AlpinDale fe21123a1c feat: TPU support (#570) hai 6 meses