Tri Dao
|
94657af3e8
Add option for not doing intra-WG overlapping of gemm and softmax
|
3 weeks ago |
Tri Dao
|
fe412d6b36
Redo rotary when contiguous
|
3 weeks ago |
Tri Dao
|
b2d3fe92ff
Move rotary to a separate file
|
4 weeks ago |
Tri Dao
|
4d00645c76
Implement appending new KV to KV cache
|
1 month ago |
Tri Dao
|
d00b88ee05
Move PagedKV to a separate file
|
1 month ago |