Tri Dao
|
a901c7eeda
Make Sm80 forward pass work with persistent scheduler
|
hace 1 mes |
Tri Dao
|
ba2061dfe8
Support cu_seqlens_k_new in flash_attn_with_kvcache
|
hace 1 mes |
Tri Dao
|
6807b1ea37
Longest-processing-time-first scheduler for causal
|
hace 1 mes |
Tri Dao
|
df96486c31
Decode: varlen, paged KV, leftpad
|
hace 2 meses |
Tri Dao
|
6e8b25e426
Refactor
|
hace 4 meses |
Ying Zhang
|
dff976a84a
fixes
|
hace 5 meses |
jayhshah
|
5018ac6ac5
Fp8 kernel with "in-kernel" transpose of V in producer (#1100)
|
hace 6 meses |
Tri Dao
|
74b0761ff7
[FA3] BF16 forward
|
hace 6 meses |
Tri Dao
|
7f67966cc7
FA3 initial code release
|
hace 6 meses |