AlpinDale
|
d9d287a288
rocm: enable multi-step scheduling for rocm (#1071)
|
1 month ago |
AlpinDale
|
61aed092a5
rocm: add support for FP8 KV cache in the custom paged attention kkernels (#1066)
|
1 month ago |
AlpinDale
|
4a7cb8f232
rocm: add custom paged attention kernels for ROCm (#1043)
|
1 month ago |
AlpinDale
|
22a4cd4595
core: fix spec decode metrics and envs circular import (#889)
|
1 month ago |
AlpinDale
|
901900854e
chore: consolidate environment variables within one file (#882)
|
2 months ago |
AlpinDale
|
1405051912
attention: add `AttentionState` abstraction (#863)
|
2 months ago |
AlpinDale
|
e200775863
feat: enable using fp8 kv and prefix caching with chunked prefill (#668)
|
5 months ago |
AlpinDale
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
5 months ago |
AlpinDale
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
9 months ago |