.. |
__init__.py
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
8 months ago |
abstract.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 months ago |
blocksparse_attn.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 months ago |
flash_attn.py
|
24456206a9
fix: logit softcapping in flash-attn (#688)
|
4 months ago |
flashinfer.py
|
300f889554
chore: update flashinfer to v0.1.3 (#685)
|
4 months ago |
ipex_attn.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 months ago |
openvino.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 months ago |
pallas.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 months ago |
placeholder_attn.py
|
bf88c8567e
feat: mamba model support (#674)
|
4 months ago |
rocm_flash_attn.py
|
e200775863
feat: enable using fp8 kv and prefix caching with chunked prefill (#668)
|
4 months ago |
torch_sdpa.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 months ago |
utils.py
|
3bbb3f2086
feat: add numpy implementation of `compute_slot_mapping` (#678)
|
4 months ago |
xformers.py
|
e200775863
feat: enable using fp8 kv and prefix caching with chunked prefill (#668)
|
4 months ago |