AlpinDale 4d33ce60da feat: Triton flash attention backend for ROCm (#407) 9 mēneši atpakaļ
..
__init__.py df7ae8ce01 fix spec_decode and block imports 9 mēneši atpakaļ
batch_expansion.py f8dfac6372 chore: attention refactor and upstream sync apr01 (#365) 9 mēneši atpakaļ
interfaces.py f8dfac6372 chore: attention refactor and upstream sync apr01 (#365) 9 mēneši atpakaļ
metrics.py d1786645a3 fix formatting 9 mēneši atpakaļ
multi_step_worker.py 2319b411ce refactor: neuron support 9 mēneši atpakaļ
spec_decode_worker.py 4d33ce60da feat: Triton flash attention backend for ROCm (#407) 9 mēneši atpakaļ
util.py f8dfac6372 chore: attention refactor and upstream sync apr01 (#365) 9 mēneši atpakaļ