AlpinDale 032974a28a tpu: fix TPU type api (#975) há 2 semanas atrás
..
backends 032974a28a tpu: fix TPU type api (#975) há 2 semanas atrás
ops e200775863 feat: enable using fp8 kv and prefix caching with chunked prefill (#668) há 4 meses atrás
__init__.py 1405051912 attention: add `AttentionState` abstraction (#863) há 1 mês atrás
layer.py bf88c8567e feat: mamba model support (#674) há 4 meses atrás
selector.py 4ddc14d653 core: use flashinfer for FP8 KV when available (#944) há 2 semanas atrás