AlpinDale 7a313483f1 chore: move update_flash_attn_metadata to attn backend (#731) il y a 3 mois
..
__init__.py 9d81716bfd [v0.5.3] Release Candidate (#388) il y a 8 mois
abstract.py 7a313483f1 chore: move update_flash_attn_metadata to attn backend (#731) il y a 3 mois
blocksparse_attn.py f1d0b77c92 [0.6.0] Release Candidate (#481) il y a 4 mois
flash_attn.py 7a313483f1 chore: move update_flash_attn_metadata to attn backend (#731) il y a 3 mois
flashinfer.py 60b702a827 chore: register custom torch ops for flash-attn and flashinfer (#724) il y a 3 mois
ipex_attn.py f1d0b77c92 [0.6.0] Release Candidate (#481) il y a 4 mois
openvino.py f1d0b77c92 [0.6.0] Release Candidate (#481) il y a 4 mois
pallas.py f1d0b77c92 [0.6.0] Release Candidate (#481) il y a 4 mois
placeholder_attn.py bf88c8567e feat: mamba model support (#674) il y a 4 mois
rocm_flash_attn.py e200775863 feat: enable using fp8 kv and prefix caching with chunked prefill (#668) il y a 4 mois
torch_sdpa.py f1d0b77c92 [0.6.0] Release Candidate (#481) il y a 4 mois
utils.py 3bbb3f2086 feat: add numpy implementation of `compute_slot_mapping` (#678) il y a 4 mois
xformers.py e200775863 feat: enable using fp8 kv and prefix caching with chunked prefill (#668) il y a 4 mois