Commit History

Autor SHA1 Mensaxe Data
  Tri Dao 8c20cfef49 [Rotary] Support qkv block layout from GQA hai 3 meses
  Ivan Komarov f692b98d80 Fix spurious re-compilations of `rotary_kernel` (#911) hai 8 meses
  Tri Dao 8a733cbd53 [Gen] Fix calling update_graph_cache in tests hai 1 ano
  Tri Dao 9795159082 [Rotary] Set device before launching Triton kernel to avoid error hai 1 ano
  Tri Dao b28ec236df [Rotary] Implement varlen rotary hai 1 ano
  Tri Dao 861c82577d [Rotary] Clean up rotary Triton implementation a bit hai 1 ano
  Tri Dao 1c523c1ce1 [Rotary] Speed up rotary kernel when interleaved=True hai 1 ano
  Tri Dao 942fcbf046 [Rotary] Implement rotary in Triton hai 1 ano