Tri Dao bedf877467 [CrossEntropy] Fix where labels address not aligned to 16 bytes il y a 2 mois
..
__init__.py 8e893f0950 Create __init__.py for ops/triton dir (#516) il y a 1 an
cross_entropy.py bedf877467 [CrossEntropy] Fix where labels address not aligned to 16 bytes il y a 2 mois
k_activations.py f1a73d0740 Run isort and black on python files il y a 1 an
layer_norm.py bcd918f275 [LayerNorm] Add option to write result to out and residual_out il y a 4 mois
linear.py 942fcbf046 [Rotary] Implement rotary in Triton il y a 1 an
mlp.py f1a73d0740 Run isort and black on python files il y a 1 an
rotary.py 8c20cfef49 [Rotary] Support qkv block layout from GQA il y a 3 mois