AlpinDale a2d476183f fix: remove scipy and re-implement CSR matrix 6 kuukautta sitten
..
backends 2105e4fd6b feat: correctly invoke prefill & decode kernels for cross-attention 6 kuukautta sitten
ops a2d476183f fix: remove scipy and re-implement CSR matrix 6 kuukautta sitten
__init__.py a94de94c44 refactor: combine the prefill and decode into a single API (#553) 7 kuukautta sitten
layer.py 2105e4fd6b feat: correctly invoke prefill & decode kernels for cross-attention 6 kuukautta sitten
selector.py b6e60143e7 Flashinfer for prefill phase (#580) 7 kuukautta sitten