Tri Dao
|
b4bf9cc1f3
Fix performance regression with causal
|
1 éve |
Tri Dao
|
9e5e8bc91e
Change causal mask to be aligned to bottom-right instead of top-left
|
1 éve |
Tri Dao
|
4f285b3547
FlashAttention-2 release
|
1 éve |
Tri Dao
|
4360cfc6a8
[Triton] Fix benchmark_causal.py
|
1 éve |
Tri Dao
|
5d079fdd7a
[Triton] Fix benchmark_causal, mention Triton version
|
1 éve |
Tri Dao
|
b0c0db81f6
Implement FlashAttention in Triton
|
2 éve |
Tri Dao
|
ed553e9238
Add Megatron attention implementation for benchmarking
|
2 éve |
Tri Dao
|
50ca23488d
Add Triton implementation for benchmarking
|
2 éve |