AlpinDale 083ba7b452 roll back chunked prefill changes to SDPA, isolate cpu worker vor 10 Monaten
..
__init__.py 0f1399c135 feat: attention refactor part 2 vor 10 Monaten
abstract.py fe17712f29 fully working chunked prefill vor 10 Monaten
flash_attn.py fe17712f29 fully working chunked prefill vor 10 Monaten
rocm_flash_attn.py fe17712f29 fully working chunked prefill vor 10 Monaten
sdpa.py 083ba7b452 roll back chunked prefill changes to SDPA, isolate cpu worker vor 10 Monaten
xformers.py fe17712f29 fully working chunked prefill vor 10 Monaten