AlpinDale 4a7cb8f232 rocm: add custom paged attention kernels for ROCm (#1043) преди 2 месеца
..
__init__.py 9d81716bfd [v0.5.3] Release Candidate (#388) преди 10 месеца
abstract.py 1390915778 multi-step: add support for flashinfer attention backend (#1033) преди 2 месеца
blocksparse_attn.py 1405051912 attention: add `AttentionState` abstraction (#863) преди 3 месеца
flash_attn.py 1390915778 multi-step: add support for flashinfer attention backend (#1033) преди 2 месеца
flashinfer.py c951a54d21 fix: multi-step + flashinfer with cuda graphs (#1036) преди 2 месеца
ipex_attn.py 6951928522 xpu: bump IPEX to 2.3, support GQA (#1042) преди 2 месеца
openvino.py 1405051912 attention: add `AttentionState` abstraction (#863) преди 3 месеца
pallas.py 032974a28a tpu: fix TPU type api (#975) преди 2 месеца
placeholder_attn.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) преди 2 месеца
rocm_flash_attn.py 4a7cb8f232 rocm: add custom paged attention kernels for ROCm (#1043) преди 2 месеца
torch_sdpa.py 1405051912 attention: add `AttentionState` abstraction (#863) преди 3 месеца
utils.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) преди 2 месеца
xformers.py 1405051912 attention: add `AttentionState` abstraction (#863) преди 3 месеца