.. |
__init__.py
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
10 ヶ月 前 |
abstract.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
6 ヶ月 前 |
blocksparse_attn.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
6 ヶ月 前 |
flash_attn.py
|
7df7b8ca53
optimization: reduce end-to-end overhead from python obj allocation (#666)
|
6 ヶ月 前 |
flashinfer.py
|
67ee885293
fix: flashinfer outputs (#657)
|
6 ヶ月 前 |
ipex_attn.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
6 ヶ月 前 |
openvino.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
6 ヶ月 前 |
pallas.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
6 ヶ月 前 |
placeholder_attn.py
|
bf88c8567e
feat: mamba model support (#674)
|
6 ヶ月 前 |
rocm_flash_attn.py
|
e200775863
feat: enable using fp8 kv and prefix caching with chunked prefill (#668)
|
6 ヶ月 前 |
torch_sdpa.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
6 ヶ月 前 |
utils.py
|
7df7b8ca53
optimization: reduce end-to-end overhead from python obj allocation (#666)
|
6 ヶ月 前 |
xformers.py
|
e200775863
feat: enable using fp8 kv and prefix caching with chunked prefill (#668)
|
6 ヶ月 前 |