AlpinDale
|
2755a48d51
merge dev branch into main (#153)
|
1 year ago |
AlpinDale
|
4e71bd1d12
feat: add PagedAttention V2 kernels (#76)
|
1 year ago |
AlpinDale
|
23389d0108
zero out a variable instead of vector in kernels
|
1 year ago |
AlpinDale
|
081545bde6
fix: various CUDA kernel tweaks
|
1 year ago |
AlpinDale
|
6aa1a9ee79
feat: add bf16 datatype headers
|
1 year ago |