Tri Dao
|
6807b1ea37
Longest-processing-time-first scheduler for causal
|
1 dag sedan |
Tri Dao
|
6293008748
Add option for Mma0_is_RS and Mma1_is_RS in attn fwd
|
1 vecka sedan |
Tri Dao
|
2c996ca25f
Use SeqlenInfo for bwd and epilogue
|
1 vecka sedan |
Tri Dao
|
9c954f7021
Use num_split_heuristics in fwd and fwd_varlen
|
1 vecka sedan |
Tri Dao
|
f6e165becf
Change tile_size and local to avoid wgmma being serialized
|
1 vecka sedan |
Tri Dao
|
94657af3e8
Add option for not doing intra-WG overlapping of gemm and softmax
|
3 veckor sedan |
Tri Dao
|
fc2fd95a18
Renable FP8 kernels
|
3 veckor sedan |
Tri Dao
|
586ba914bb
Move fwd tile size to a separate file
|
3 veckor sedan |