.. |
backends
|
2105e4fd6b
feat: correctly invoke prefill & decode kernels for cross-attention
|
пре 6 месеци |
ops
|
a2d476183f
fix: remove scipy and re-implement CSR matrix
|
пре 6 месеци |
__init__.py
|
a94de94c44
refactor: combine the prefill and decode into a single API (#553)
|
пре 7 месеци |
layer.py
|
2105e4fd6b
feat: correctly invoke prefill & decode kernels for cross-attention
|
пре 6 месеци |
selector.py
|
b6e60143e7
Flashinfer for prefill phase (#580)
|
пре 7 месеци |