Tri Dao 68bf390920 Update Cutlass to fix mem fence 2 viikkoa sitten
..
composable_kernel @ 13332998a4 88d1657a14 [AMD ROCm] Fix KVcache bug and improve performance (#1328) 2 kuukautta sitten
cutlass @ c506e16788 68bf390920 Update Cutlass to fix mem fence 2 viikkoa sitten
flash_attn 36dddb891c Remove unused 224 configs (#1425) 2 viikkoa sitten
flash_attn_ck 1feb711f46 Fix compilation with clang on ARM64 (#1285) 1 kuukausi sitten
ft_attention 1feb711f46 Fix compilation with clang on ARM64 (#1285) 1 kuukausi sitten
fused_dense_lib 1feb711f46 Fix compilation with clang on ARM64 (#1285) 1 kuukausi sitten
fused_softmax 50896ec574 Make nvcc threads configurable via environment variable (#885) 10 kuukautta sitten
layer_norm 1feb711f46 Fix compilation with clang on ARM64 (#1285) 1 kuukausi sitten
rotary 1feb711f46 Fix compilation with clang on ARM64 (#1285) 1 kuukausi sitten
xentropy 1feb711f46 Fix compilation with clang on ARM64 (#1285) 1 kuukausi sitten