提交历史

作者 SHA1 备注 提交日期
  Tri Dao a84a237d2a Split bwd softcap compilation units for Sm80 1 月之前
  Tri Dao 180ff782dd Template for Sm86 1 月之前
  Tri Dao 7bc3f031a4 Compile for both Sm80 and Sm90 1 月之前
  Tri Dao 7a802796e1 Big refactor and update 1 月之前
  Ying Zhang 1c9717d699 address comments 4 月之前
  Ying Zhang dff976a84a fixes 5 月之前
  Ying Zhang 7b4e68e04f hopper local attention 5 月之前
  Ying Zhang db80387343 Add seqused_q in fwd / bwd and seqused_k in bwd. 5 月之前
  Cameron Shinn 3cea2fb6ee Add ArchTag to pre/postprocess bwd kernels (#1180) 5 月之前
  Tri Dao bafe253042 [FA3] Bwd 6 月之前
  Cameron Shinn cb516f855b Remove torchlib dependency from cpp files (#1083) 6 月之前
  Tri Dao 7f67966cc7 FA3 initial code release 7 月之前