.. |
backends
|
083ba7b452
roll back chunked prefill changes to SDPA, isolate cpu worker
|
9 ay önce |
ops
|
1270b5567e
triton compile error for flash_attn
|
9 ay önce |
__init__.py
|
fe17712f29
fully working chunked prefill
|
9 ay önce |
layer.py
|
fe17712f29
fully working chunked prefill
|
9 ay önce |
selector.py
|
4d33ce60da
feat: Triton flash attention backend for ROCm (#407)
|
9 ay önce |