Commit Verlauf

Autor SHA1 Nachricht Datum
  Tri Dao 8c20cfef49 [Rotary] Support qkv block layout from GQA vor 3 Monaten
  Ivan Komarov f692b98d80 Fix spurious re-compilations of `rotary_kernel` (#911) vor 8 Monaten
  Tri Dao 8a733cbd53 [Gen] Fix calling update_graph_cache in tests vor 1 Jahr
  Tri Dao 9795159082 [Rotary] Set device before launching Triton kernel to avoid error vor 1 Jahr
  Tri Dao b28ec236df [Rotary] Implement varlen rotary vor 1 Jahr
  Tri Dao 861c82577d [Rotary] Clean up rotary Triton implementation a bit vor 1 Jahr
  Tri Dao 1c523c1ce1 [Rotary] Speed up rotary kernel when interleaved=True vor 1 Jahr
  Tri Dao 942fcbf046 [Rotary] Implement rotary in Triton vor 1 Jahr