Driss Guessous bc482cbf91 Add a macro for namespace (#1419) 2 周之前
..
composable_kernel @ 888317e698 77ad12d24e [AMD ROCm] Support variable length of page attention (#1431) 2 周之前
cutlass @ c506e16788 68bf390920 Update Cutlass to fix mem fence 3 周之前
flash_attn bc482cbf91 Add a macro for namespace (#1419) 2 周之前
flash_attn_ck 74aed78373 Replace c10::optional with std::optional in flash_attn 2 周之前
ft_attention 74aed78373 Replace c10::optional with std::optional in flash_attn 2 周之前
fused_dense_lib 74aed78373 Replace c10::optional with std::optional in flash_attn 2 周之前
fused_softmax 50896ec574 Make nvcc threads configurable via environment variable (#885) 10 月之前
layer_norm 74aed78373 Replace c10::optional with std::optional in flash_attn 2 周之前
rotary 1feb711f46 Fix compilation with clang on ARM64 (#1285) 1 月之前
xentropy 1feb711f46 Fix compilation with clang on ARM64 (#1285) 1 月之前