Historique des commits

Auteur SHA1 Message Date
  AlpinDale b6c4dfce23 chore: refactor TPU model runner and worker il y a 5 mois
  AlpinDale 1ff6d4c3d7 feat: support pipeline parallel on indivisible GPU count (#587) il y a 5 mois
  AlpinDale 4f7d212b70 feat: remove vision language config il y a 5 mois
  AlpinDale d0ff3fd59e fix: tpu sampler output il y a 5 mois
  AlpinDale d2461161ec chore: optimize KV cache swapping for TPU il y a 5 mois
  AlpinDale 8b626e4032 fix: cpu kv cache allocation for TPU il y a 5 mois
  AlpinDale fcd58614f4 feat: support parallel sampling and swapping in TPU il y a 5 mois
  AlpinDale af1286f9fa fix: kv cache size calculation on TPUs il y a 5 mois
  AlpinDale 608e8e1310 chore: refactor TPU backend to make it more similar to GPU backend il y a 5 mois
  AlpinDale a524667db0 fix: device assertion for sdpa backend; fix env for tpu worker il y a 5 mois
  AlpinDale fe21123a1c feat: TPU support (#570) il y a 5 mois