.. |
attention
|
05d0a7e763
feat: adapt the attention kernels
|
1 year ago |
activation.cpp
|
28866137ea
feat: add swiglu activation
|
1 year ago |
activation_kernels.cu
|
28866137ea
feat: add swiglu activation
|
1 year ago |
attention.cpp
|
d40a8d6bb0
chore: bind single_query_cached_kv_attention to python
|
1 year ago |
cache.cpp
|
a409431c40
feat: draft for cuda kernels
|
1 year ago |
cache_kernels.cu
|
a409431c40
feat: draft for cuda kernels
|
1 year ago |