Commit History

Autor SHA1 Mensaxe Data
  Tri Dao 6807b1ea37 Longest-processing-time-first scheduler for causal hai 1 día
  Tri Dao 6293008748 Add option for Mma0_is_RS and Mma1_is_RS in attn fwd hai 1 semana
  Tri Dao 2c996ca25f Use SeqlenInfo for bwd and epilogue hai 1 semana
  Tri Dao 9c954f7021 Use num_split_heuristics in fwd and fwd_varlen hai 1 semana
  Tri Dao f6e165becf Change tile_size and local to avoid wgmma being serialized hai 1 semana
  Tri Dao 94657af3e8 Add option for not doing intra-WG overlapping of gemm and softmax hai 3 semanas
  Tri Dao fc2fd95a18 Renable FP8 kernels hai 3 semanas
  Tri Dao 586ba914bb Move fwd tile size to a separate file hai 3 semanas