Commit History

Author SHA1 Message Date
  Tri Dao 2c996ca25f Use SeqlenInfo for bwd and epilogue 1 week ago
  Tri Dao 29cdfedd80 Use Bulk reduce instead of TMA for dQaccum, split across WGs 1 week ago
  Tri Dao 82dc825759 Don't use the unsafe convert_type function 2 weeks ago
  Tri Dao df96486c31 Decode: varlen, paged KV, leftpad 1 month ago
  Tri Dao 6e8b25e426 Refactor 2 months ago
  Ying Zhang db80387343 Add seqused_q in fwd / bwd and seqused_k in bwd. 3 months ago
  Cameron Shinn 3cea2fb6ee Add ArchTag to pre/postprocess bwd kernels (#1180) 3 months ago
  Tri Dao bafe253042 [FA3] Bwd 4 months ago
  Tri Dao 7f67966cc7 FA3 initial code release 5 months ago