.. |
backends
|
7a313483f1
chore: move update_flash_attn_metadata to attn backend (#731)
|
3 ماه پیش |
ops
|
e200775863
feat: enable using fp8 kv and prefix caching with chunked prefill (#668)
|
4 ماه پیش |
__init__.py
|
a0e446a17d
feat: initial encoder-decoder support with BART model (#633)
|
4 ماه پیش |
layer.py
|
bf88c8567e
feat: mamba model support (#674)
|
4 ماه پیش |
selector.py
|
5d37ec1016
suppress tpu import warning (#696)
|
4 ماه پیش |