AlpinDale 083ba7b452 roll back chunked prefill changes to SDPA, isolate cpu worker há 9 meses atrás
..
__init__.py 0f1399c135 feat: attention refactor part 2 há 9 meses atrás
abstract.py fe17712f29 fully working chunked prefill há 9 meses atrás
flash_attn.py fe17712f29 fully working chunked prefill há 9 meses atrás
rocm_flash_attn.py fe17712f29 fully working chunked prefill há 9 meses atrás
sdpa.py 083ba7b452 roll back chunked prefill changes to SDPA, isolate cpu worker há 9 meses atrás
xformers.py fe17712f29 fully working chunked prefill há 9 meses atrás