AlpinDale 083ba7b452 roll back chunked prefill changes to SDPA, isolate cpu worker vor 9 Monaten
..
__init__.py 0f1399c135 feat: attention refactor part 2 vor 9 Monaten
abstract.py fe17712f29 fully working chunked prefill vor 9 Monaten
flash_attn.py fe17712f29 fully working chunked prefill vor 9 Monaten
rocm_flash_attn.py fe17712f29 fully working chunked prefill vor 9 Monaten
sdpa.py 083ba7b452 roll back chunked prefill changes to SDPA, isolate cpu worker vor 9 Monaten
xformers.py fe17712f29 fully working chunked prefill vor 9 Monaten