.. |
backends
|
2105e4fd6b
feat: correctly invoke prefill & decode kernels for cross-attention
|
6 kuukautta sitten |
ops
|
a2d476183f
fix: remove scipy and re-implement CSR matrix
|
6 kuukautta sitten |
__init__.py
|
a94de94c44
refactor: combine the prefill and decode into a single API (#553)
|
7 kuukautta sitten |
layer.py
|
2105e4fd6b
feat: correctly invoke prefill & decode kernels for cross-attention
|
6 kuukautta sitten |
selector.py
|
b6e60143e7
Flashinfer for prefill phase (#580)
|
7 kuukautta sitten |