.. |
backends
|
ba2c3fc88d
feat: add Tencent Hunyuan model support
|
2 months ago |
ops
|
e200775863
feat: enable using fp8 kv and prefix caching with chunked prefill (#668)
|
4 months ago |
__init__.py
|
a0e446a17d
feat: initial encoder-decoder support with BART model (#633)
|
4 months ago |
layer.py
|
bf88c8567e
feat: mamba model support (#674)
|
4 months ago |
selector.py
|
5d37ec1016
suppress tpu import warning (#696)
|
4 months ago |