Commit History

Author SHA1 Message Date
  AlpinDale d9d287a288 rocm: enable multi-step scheduling for rocm (#1071) 1 month ago
  AlpinDale 61aed092a5 rocm: add support for FP8 KV cache in the custom paged attention kkernels (#1066) 1 month ago
  AlpinDale 4a7cb8f232 rocm: add custom paged attention kernels for ROCm (#1043) 1 month ago
  AlpinDale 22a4cd4595 core: fix spec decode metrics and envs circular import (#889) 1 month ago
  AlpinDale 901900854e chore: consolidate environment variables within one file (#882) 2 months ago
  AlpinDale 1405051912 attention: add `AttentionState` abstraction (#863) 2 months ago
  AlpinDale e200775863 feat: enable using fp8 kv and prefix caching with chunked prefill (#668) 5 months ago
  AlpinDale f1d0b77c92 [0.6.0] Release Candidate (#481) 5 months ago
  AlpinDale 9d81716bfd [v0.5.3] Release Candidate (#388) 9 months ago