Commit History

Autor SHA1 Mensaxe Data
  Tri Dao 7bc3f031a4 Compile for both Sm80 and Sm90 hai 2 semanas
  Tri Dao 7a802796e1 Big refactor and update hai 2 semanas
  Ying Zhang 1c9717d699 address comments hai 4 meses
  Ying Zhang dff976a84a fixes hai 4 meses
  Ying Zhang 7b4e68e04f hopper local attention hai 4 meses
  Ying Zhang db80387343 Add seqused_q in fwd / bwd and seqused_k in bwd. hai 5 meses
  Cameron Shinn 3cea2fb6ee Add ArchTag to pre/postprocess bwd kernels (#1180) hai 5 meses
  Tri Dao bafe253042 [FA3] Bwd hai 5 meses
  Cameron Shinn cb516f855b Remove torchlib dependency from cpp files (#1083) hai 6 meses
  Tri Dao 7f67966cc7 FA3 initial code release hai 6 meses