Tri Dao
|
65f723bb9a
Split bwd into more .cu files to speed up compilation
|
6 months ago |
Tri Dao
|
908511b2b6
Split into more .cu files to speed up compilation
|
6 months ago |
Tri Dao
|
7a983df742
Use generate_kernels.py script from Driss Guessous
|
1 year ago |
Tri Dao
|
4f285b3547
FlashAttention-2 release
|
1 year ago |