Historial de Commits

Autor SHA1 Mensaje Fecha
  Tri Dao f907a13187 Tune tile sizes for fwd varlen on Sm80 and Sm86 hace 4 semanas
  Tri Dao 76f14c61c9 Tune fwd tile sizes for Sm86 and Sm89 hace 4 semanas
  Tri Dao 5171269dab Implement forward pass for Sm80 hace 1 mes
  Tri Dao 3f85126149 Use persistent scheduler when paged_kv hace 1 mes
  Tri Dao 3e5d77a102 Group instantiations for different hdims together hace 1 mes
  Tri Dao 6807b1ea37 Longest-processing-time-first scheduler for causal hace 1 mes
  Tri Dao 6293008748 Add option for Mma0_is_RS and Mma1_is_RS in attn fwd hace 1 mes
  Tri Dao 2c996ca25f Use SeqlenInfo for bwd and epilogue hace 1 mes
  Tri Dao 9c954f7021 Use num_split_heuristics in fwd and fwd_varlen hace 2 meses
  Tri Dao f6e165becf Change tile_size and local to avoid wgmma being serialized hace 2 meses
  Tri Dao 94657af3e8 Add option for not doing intra-WG overlapping of gemm and softmax hace 2 meses
  Tri Dao fc2fd95a18 Renable FP8 kernels hace 2 meses
  Tri Dao 586ba914bb Move fwd tile size to a separate file hace 2 meses