Commit History

Auteur SHA1 Bericht Datum
  AlpinDale d9d287a288 rocm: enable multi-step scheduling for rocm (#1071) 1 maand geleden
  AlpinDale 61aed092a5 rocm: add support for FP8 KV cache in the custom paged attention kkernels (#1066) 1 maand geleden
  AlpinDale 4a7cb8f232 rocm: add custom paged attention kernels for ROCm (#1043) 1 maand geleden
  AlpinDale 22a4cd4595 core: fix spec decode metrics and envs circular import (#889) 1 maand geleden
  AlpinDale 901900854e chore: consolidate environment variables within one file (#882) 2 maanden geleden
  AlpinDale 1405051912 attention: add `AttentionState` abstraction (#863) 2 maanden geleden
  AlpinDale e200775863 feat: enable using fp8 kv and prefix caching with chunked prefill (#668) 5 maanden geleden
  AlpinDale f1d0b77c92 [0.6.0] Release Candidate (#481) 5 maanden geleden
  AlpinDale 9d81716bfd [v0.5.3] Release Candidate (#388) 9 maanden geleden