Commit Verlauf

Autor SHA1 Nachricht Datum
  AlpinDale d9d287a288 rocm: enable multi-step scheduling for rocm (#1071) vor 1 Monat
  AlpinDale 61aed092a5 rocm: add support for FP8 KV cache in the custom paged attention kkernels (#1066) vor 1 Monat
  AlpinDale 4a7cb8f232 rocm: add custom paged attention kernels for ROCm (#1043) vor 1 Monat
  AlpinDale 22a4cd4595 core: fix spec decode metrics and envs circular import (#889) vor 1 Monat
  AlpinDale 901900854e chore: consolidate environment variables within one file (#882) vor 2 Monaten
  AlpinDale 1405051912 attention: add `AttentionState` abstraction (#863) vor 2 Monaten
  AlpinDale e200775863 feat: enable using fp8 kv and prefix caching with chunked prefill (#668) vor 5 Monaten
  AlpinDale f1d0b77c92 [0.6.0] Release Candidate (#481) vor 5 Monaten
  AlpinDale 9d81716bfd [v0.5.3] Release Candidate (#388) vor 9 Monaten