Tri Dao
|
fc2fd95a18
Renable FP8 kernels
|
3 weeks ago |
Tri Dao
|
64d92bce53
Split PagedKV into separate .cu files to speed up compilation
|
3 weeks ago |
Tri Dao
|
586ba914bb
Move fwd tile size to a separate file
|
3 weeks ago |
Tri Dao
|
018b9af683
Move .cu files to instantiations, use generate_kernels.py
|
3 weeks ago |