AlpinDale d9d287a288 rocm: enable multi-step scheduling for rocm (#1071) пре 5 дана
..
backends d9d287a288 rocm: enable multi-step scheduling for rocm (#1071) пре 5 дана
ops e200775863 feat: enable using fp8 kv and prefix caching with chunked prefill (#668) пре 4 месеци
__init__.py 1405051912 attention: add `AttentionState` abstraction (#863) пре 1 месец
layer.py bf88c8567e feat: mamba model support (#674) пре 4 месеци
selector.py 4ddc14d653 core: use flashinfer for FP8 KV when available (#944) пре 2 недеља