.. |
cutlass @ 756c351b49
|
74b0761ff7
[FA3] BF16 forward
|
5 mesiacov pred |
flash_attn
|
40e534a7f6
Implement cache_leftpad
|
5 mesiacov pred |
ft_attention
|
50896ec574
Make nvcc threads configurable via environment variable (#885)
|
9 mesiacov pred |
fused_dense_lib
|
50896ec574
Make nvcc threads configurable via environment variable (#885)
|
9 mesiacov pred |
fused_softmax
|
50896ec574
Make nvcc threads configurable via environment variable (#885)
|
9 mesiacov pred |
layer_norm
|
50896ec574
Make nvcc threads configurable via environment variable (#885)
|
9 mesiacov pred |
rotary
|
50896ec574
Make nvcc threads configurable via environment variable (#885)
|
9 mesiacov pred |
xentropy
|
50896ec574
Make nvcc threads configurable via environment variable (#885)
|
9 mesiacov pred |