Commit History

Autor SHA1 Mensaxe Data
  Ying Zhang cdbbe844b1 minor changes to unpad_input test util func hai 4 meses
  Tri Dao cc1690d9d6 [Rotary] Add test for rotary when qkv are packed an there's GQA hai 4 meses
  Ivan Komarov f692b98d80 Fix spurious re-compilations of `rotary_kernel` (#911) hai 9 meses
  Tri Dao b28ec236df [Rotary] Implement varlen rotary hai 1 ano
  Tri Dao 1c523c1ce1 [Rotary] Speed up rotary kernel when interleaved=True hai 1 ano
  Tri Dao 942fcbf046 [Rotary] Implement rotary in Triton hai 1 ano
  Tri Dao 0e8c46ae08 Run isort and black on test files hai 1 ano
  Tri Dao d4b320b31f Add MLP, MHA, Block, Embedding modules %!s(int64=2) %!d(string=hai) anos
  Tri Dao ca81f32e04 Implement rotary embedding in CUDA %!s(int64=2) %!d(string=hai) anos