.. |
composable_kernel @ 13332998a4
|
88d1657a14
[AMD ROCm] Fix KVcache bug and improve performance (#1328)
|
1 ヶ月 前 |
cutlass @ 4c42f73fda
|
61a23ea8a2
Update to Cutlass 3.6.0
|
1 週間 前 |
flash_attn
|
61a23ea8a2
Update to Cutlass 3.6.0
|
1 週間 前 |
flash_attn_ck
|
1feb711f46
Fix compilation with clang on ARM64 (#1285)
|
1 週間 前 |
ft_attention
|
1feb711f46
Fix compilation with clang on ARM64 (#1285)
|
1 週間 前 |
fused_dense_lib
|
1feb711f46
Fix compilation with clang on ARM64 (#1285)
|
1 週間 前 |
fused_softmax
|
50896ec574
Make nvcc threads configurable via environment variable (#885)
|
9 ヶ月 前 |
layer_norm
|
1feb711f46
Fix compilation with clang on ARM64 (#1285)
|
1 週間 前 |
rotary
|
1feb711f46
Fix compilation with clang on ARM64 (#1285)
|
1 週間 前 |
xentropy
|
1feb711f46
Fix compilation with clang on ARM64 (#1285)
|
1 週間 前 |