Tri Dao b16d814c62 Revert to before Cutlass 3.6.0 update to investigate perf issue преди 1 седмица
..
composable_kernel @ a9b170b541 e2182cc21d Support page kvcache in AMD ROCm (#1198) преди 3 месеца
cutlass @ e1cd8c7866 b16d814c62 Revert to before Cutlass 3.6.0 update to investigate perf issue преди 1 седмица
flash_attn 83e41b3ca4 Add custom ops for compatibility with PT Compile (#1139) преди 3 месеца
flash_attn_ck 53a4f34163 Hotfix due to change of upstream api (#1239) преди 3 месеца
ft_attention 50896ec574 Make nvcc threads configurable via environment variable (#885) преди 10 месеца
fused_dense_lib 50896ec574 Make nvcc threads configurable via environment variable (#885) преди 10 месеца
fused_softmax 50896ec574 Make nvcc threads configurable via environment variable (#885) преди 10 месеца
layer_norm 50896ec574 Make nvcc threads configurable via environment variable (#885) преди 10 месеца
rotary 50896ec574 Make nvcc threads configurable via environment variable (#885) преди 10 месеца
xentropy 50896ec574 Make nvcc threads configurable via environment variable (#885) преди 10 месеца