.. |
__init__.py
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
8 months ago |
abstract.py
|
4d4e767838
ci: take one of fixing lint issues
|
4 months ago |
blocksparse_attn.py
|
614ca6b0bf
feat: support logits soft capping with flash attention backend
|
4 months ago |
flash_attn.py
|
6c1eab6a6c
feat: non-blocking transfer in prepare_input
|
4 months ago |
flashinfer.py
|
6c1eab6a6c
feat: non-blocking transfer in prepare_input
|
4 months ago |
ipex_attn.py
|
614ca6b0bf
feat: support logits soft capping with flash attention backend
|
4 months ago |
openvino.py
|
0886c361f4
feat: OpenVINO CPU backend (#576)
|
5 months ago |
pallas.py
|
614ca6b0bf
feat: support logits soft capping with flash attention backend
|
4 months ago |
rocm_flash_attn.py
|
4d4e767838
ci: take one of fixing lint issues
|
4 months ago |
torch_sdpa.py
|
614ca6b0bf
feat: support logits soft capping with flash attention backend
|
4 months ago |
utils.py
|
0e6c400b13
feat: re-add GGUF (#600)
|
4 months ago |
xformers.py
|
614ca6b0bf
feat: support logits soft capping with flash attention backend
|
4 months ago |