Histórico de Commits

Autor SHA1 Mensagem Data
  Antoni Viros 83e41b3ca4 Add custom ops for compatibility with PT Compile (#1139) há 3 meses atrás
  Tri Dao 65f723bb9a Split bwd into more .cu files to speed up compilation há 5 meses atrás
  Tri Dao 751c762c9c Don't specialize for hdim 224 to speed up compilation há 5 meses atrás
  Driss Guessous 1c275eb070 Fix ima for split-kv kernel (#1085) há 5 meses atrás
  Jorge António 4df62e1440 catch typo (#1058) há 5 meses atrás
  Tri Dao 40e534a7f6 Implement cache_leftpad há 6 meses atrás
  Tri Dao dca6d89da4 Don't support softcap and dropout at the same time há 6 meses atrás
  Tri Dao 908511b2b6 Split into more .cu files to speed up compilation há 6 meses atrás
  Tri Dao 1d536d7de5 Minor cleanup of softcapping há 6 meses atrás
  Nicolas Patry 8f873cc6ac Implement softcapping. (#1025) há 6 meses atrás
  Nicolas Patry 5bf201966a Fixing argument checking when using `seqlenq_ngroups_swapped`. (#976) há 6 meses atrás
  Grigory Sizov f816dee63c Support unpadded LSE layout (#970) há 6 meses atrás
  Tri Dao 9eb3d099c1 Transpose out when swapping seqlen_q and num_groups há 9 meses atrás
  Driss Guessous 4a73e903da Add in, macrosf for defining __grid_constant__ (#852) há 9 meses atrás
  Grigory Sizov 2a15840f09 Enable paged attention in varlen forward (#831) há 9 meses atrás
  Tri Dao 2406f28805 Enable headdim 256 backward on consumer GPUs (Ampere, Ada) há 10 meses atrás
  Tri Dao d9a5cb291c Fix dv = torch::empty_like(k) for mha_bwd_varlen as well há 11 meses atrás
  Brian Hirsh 2423cca3ad fix backward for when query and key have different contiguity (#818) há 11 meses atrás
  Grigory Sizov 4687936413 Fix Windows build (#816) há 11 meses atrás
  Jeremy Reizenstein 0658e320f6 Preprocessor switches to control functionality (#788) há 11 meses atrás
  Tri Dao 54e80a3829 Implement page KV cache há 11 meses atrás
  Tri Dao ea8a25ca38 Remove configure in bwd kernel launch há 11 meses atrás
  Grigory Sizov af01244ddd Add split-kv and M<->H swap to varlen forward decoding attention (#754) há 11 meses atrás
  Tri Dao 0842ec0da4 Don't dispatch to local if window size >= seqlen_k há 1 ano atrás
  Tri Dao 732654583c Implement deterministic backward (thanks to Meituan) há 1 ano atrás
  Tri Dao 5ab9b3667b Clean up alibi, implement non-causal alibi há 1 ano atrás
  Sanghun Cho e4f726fc44 Support alibi, by Sanghun Cho from Kakao Brain há 1 ano atrás
  Jeremy Reizenstein ce3e7280f8 Allow varlen_fwd to take optional seqused_k (#647) há 1 ano atrás
  Tri Dao db2f80692c Write zero to out / grad if seqlen_q or seqlen_k is zero há 1 ano atrás
  Tri Dao e279bf8ed9 [Gen] Accept cache_batch_idx to index into the KV cache há 1 ano atrás