Tri Dao 5acb532214 Switch to cutlass v3.6.0, fix perf regression for hdim 128 causal пре 3 дана
..
composable_kernel @ a9b170b541 e2182cc21d Support page kvcache in AMD ROCm (#1198) пре 3 месеци
cutlass @ bf9da7b76c 5acb532214 Switch to cutlass v3.6.0, fix perf regression for hdim 128 causal пре 3 дана
flash_attn 83e41b3ca4 Add custom ops for compatibility with PT Compile (#1139) пре 3 месеци
flash_attn_ck 53a4f34163 Hotfix due to change of upstream api (#1239) пре 3 месеци
ft_attention 50896ec574 Make nvcc threads configurable via environment variable (#885) пре 10 месеци
fused_dense_lib 50896ec574 Make nvcc threads configurable via environment variable (#885) пре 10 месеци
fused_softmax 50896ec574 Make nvcc threads configurable via environment variable (#885) пре 10 месеци
layer_norm 50896ec574 Make nvcc threads configurable via environment variable (#885) пре 10 месеци
rotary 50896ec574 Make nvcc threads configurable via environment variable (#885) пре 10 месеци
xentropy 50896ec574 Make nvcc threads configurable via environment variable (#885) пре 10 месеци