AlpinDale 7a313483f1 chore: move update_flash_attn_metadata to attn backend (#731) vor 4 Monaten
..
__init__.py 9d81716bfd [v0.5.3] Release Candidate (#388) vor 8 Monaten
abstract.py 7a313483f1 chore: move update_flash_attn_metadata to attn backend (#731) vor 4 Monaten
blocksparse_attn.py f1d0b77c92 [0.6.0] Release Candidate (#481) vor 4 Monaten
flash_attn.py 7a313483f1 chore: move update_flash_attn_metadata to attn backend (#731) vor 4 Monaten
flashinfer.py 60b702a827 chore: register custom torch ops for flash-attn and flashinfer (#724) vor 4 Monaten
ipex_attn.py f1d0b77c92 [0.6.0] Release Candidate (#481) vor 4 Monaten
openvino.py f1d0b77c92 [0.6.0] Release Candidate (#481) vor 4 Monaten
pallas.py f1d0b77c92 [0.6.0] Release Candidate (#481) vor 4 Monaten
placeholder_attn.py bf88c8567e feat: mamba model support (#674) vor 4 Monaten
rocm_flash_attn.py e200775863 feat: enable using fp8 kv and prefix caching with chunked prefill (#668) vor 4 Monaten
torch_sdpa.py f1d0b77c92 [0.6.0] Release Candidate (#481) vor 4 Monaten
utils.py 3bbb3f2086 feat: add numpy implementation of `compute_slot_mapping` (#678) vor 4 Monaten
xformers.py e200775863 feat: enable using fp8 kv and prefix caching with chunked prefill (#668) vor 4 Monaten