Tri Dao bedf877467 [CrossEntropy] Fix where labels address not aligned to 16 bytes 2 月之前
..
__init__.py 8e893f0950 Create __init__.py for ops/triton dir (#516) 1 年之前
cross_entropy.py bedf877467 [CrossEntropy] Fix where labels address not aligned to 16 bytes 2 月之前
k_activations.py f1a73d0740 Run isort and black on python files 1 年之前
layer_norm.py bcd918f275 [LayerNorm] Add option to write result to out and residual_out 4 月之前
linear.py 942fcbf046 [Rotary] Implement rotary in Triton 1 年之前
mlp.py f1a73d0740 Run isort and black on python files 1 年之前
rotary.py 8c20cfef49 [Rotary] Support qkv block layout from GQA 3 月之前