AlpinDale bf88c8567e feat: mamba model support (#674) hace 6 meses
..
__init__.py 9d81716bfd [v0.5.3] Release Candidate (#388) hace 10 meses
abstract.py f1d0b77c92 [0.6.0] Release Candidate (#481) hace 6 meses
blocksparse_attn.py f1d0b77c92 [0.6.0] Release Candidate (#481) hace 6 meses
flash_attn.py 7df7b8ca53 optimization: reduce end-to-end overhead from python obj allocation (#666) hace 6 meses
flashinfer.py 67ee885293 fix: flashinfer outputs (#657) hace 6 meses
ipex_attn.py f1d0b77c92 [0.6.0] Release Candidate (#481) hace 6 meses
openvino.py f1d0b77c92 [0.6.0] Release Candidate (#481) hace 6 meses
pallas.py f1d0b77c92 [0.6.0] Release Candidate (#481) hace 6 meses
placeholder_attn.py bf88c8567e feat: mamba model support (#674) hace 6 meses
rocm_flash_attn.py e200775863 feat: enable using fp8 kv and prefix caching with chunked prefill (#668) hace 6 meses
torch_sdpa.py f1d0b77c92 [0.6.0] Release Candidate (#481) hace 6 meses
utils.py 7df7b8ca53 optimization: reduce end-to-end overhead from python obj allocation (#666) hace 6 meses
xformers.py e200775863 feat: enable using fp8 kv and prefix caching with chunked prefill (#668) hace 6 meses