Commit History

Autor SHA1 Mensaxe Data
  Tri Dao b4bf9cc1f3 Fix performance regression with causal hai 1 ano
  Tri Dao 9e5e8bc91e Change causal mask to be aligned to bottom-right instead of top-left hai 1 ano
  Tri Dao 4f285b3547 FlashAttention-2 release hai 1 ano
  Tri Dao 4360cfc6a8 [Triton] Fix benchmark_causal.py hai 1 ano
  Tri Dao 5d079fdd7a [Triton] Fix benchmark_causal, mention Triton version hai 1 ano
  Tri Dao b0c0db81f6 Implement FlashAttention in Triton %!s(int64=2) %!d(string=hai) anos
  Tri Dao ed553e9238 Add Megatron attention implementation for benchmarking %!s(int64=2) %!d(string=hai) anos
  Tri Dao 50ca23488d Add Triton implementation for benchmarking %!s(int64=2) %!d(string=hai) anos