Commit History

Auteur SHA1 Bericht Datum
  Tri Dao 8c20cfef49 [Rotary] Support qkv block layout from GQA 3 maanden geleden
  Ivan Komarov f692b98d80 Fix spurious re-compilations of `rotary_kernel` (#911) 8 maanden geleden
  Tri Dao 8a733cbd53 [Gen] Fix calling update_graph_cache in tests 1 jaar geleden
  Tri Dao 9795159082 [Rotary] Set device before launching Triton kernel to avoid error 1 jaar geleden
  Tri Dao b28ec236df [Rotary] Implement varlen rotary 1 jaar geleden
  Tri Dao 861c82577d [Rotary] Clean up rotary Triton implementation a bit 1 jaar geleden
  Tri Dao 1c523c1ce1 [Rotary] Speed up rotary kernel when interleaved=True 1 jaar geleden
  Tri Dao 942fcbf046 [Rotary] Implement rotary in Triton 1 jaar geleden