.. |
__init__.py
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
il y a 8 mois |
abstract.py
|
1390915778
multi-step: add support for flashinfer attention backend (#1033)
|
il y a 2 semaines |
blocksparse_attn.py
|
1405051912
attention: add `AttentionState` abstraction (#863)
|
il y a 1 mois |
flash_attn.py
|
1390915778
multi-step: add support for flashinfer attention backend (#1033)
|
il y a 2 semaines |
flashinfer.py
|
1390915778
multi-step: add support for flashinfer attention backend (#1033)
|
il y a 2 semaines |
ipex_attn.py
|
1405051912
attention: add `AttentionState` abstraction (#863)
|
il y a 1 mois |
openvino.py
|
1405051912
attention: add `AttentionState` abstraction (#863)
|
il y a 1 mois |
pallas.py
|
032974a28a
tpu: fix TPU type api (#975)
|
il y a 2 semaines |
placeholder_attn.py
|
3bb0f07461
chore: rename `task_handler` to `worker` (#985)
|
il y a 2 semaines |
rocm_flash_attn.py
|
22a4cd4595
core: fix spec decode metrics and envs circular import (#889)
|
il y a 3 semaines |
torch_sdpa.py
|
1405051912
attention: add `AttentionState` abstraction (#863)
|
il y a 1 mois |
utils.py
|
3bb0f07461
chore: rename `task_handler` to `worker` (#985)
|
il y a 2 semaines |
xformers.py
|
1405051912
attention: add `AttentionState` abstraction (#863)
|
il y a 1 mois |