Driss Guessous bc482cbf91 Add a macro for namespace (#1419) hace 3 semanas
..
composable_kernel @ 888317e698 77ad12d24e [AMD ROCm] Support variable length of page attention (#1431) hace 4 semanas
cutlass @ c506e16788 68bf390920 Update Cutlass to fix mem fence hace 1 mes
flash_attn bc482cbf91 Add a macro for namespace (#1419) hace 3 semanas
flash_attn_ck 74aed78373 Replace c10::optional with std::optional in flash_attn hace 3 semanas
ft_attention 74aed78373 Replace c10::optional with std::optional in flash_attn hace 3 semanas
fused_dense_lib 74aed78373 Replace c10::optional with std::optional in flash_attn hace 3 semanas
fused_softmax 50896ec574 Make nvcc threads configurable via environment variable (#885) hace 11 meses
layer_norm 74aed78373 Replace c10::optional with std::optional in flash_attn hace 3 semanas
rotary 1feb711f46 Fix compilation with clang on ARM64 (#1285) hace 2 meses
xentropy 1feb711f46 Fix compilation with clang on ARM64 (#1285) hace 2 meses