This website works better with JavaScript
Home
Esplora
Aiuto
Registrati
Accedi
david
/
flash-attention
mirror da
https://github.com/Dao-AILab/flash-attention
Segui
1
Vota
0
Forka
0
File
Problemi
0
Wiki
Ramo (Branch):
ipiszy/used_q
Rami (Branch)
Tag
decode
doc_masking
fa3-fp8-varlen
fa3-kvcache-gqa
ipiszy/local_attn
ipiszy/used_q
main
tdd
v2.7.4.post1
v2.7.4
v2.7.3
v2.7.2.post1
v2.7.2
v2.7.1.post4
v2.7.1.post3
v2.7.1.post2
v2.7.1.post1
v2.7.1
v2.7.0.post2
v2.7.0.post1
v2.7.0
v2.6.3
v2.6.2
v2.6.1
v2.6.0.post1
v2.6.0
v2.5.9.post1
v2.5.9
v2.5.8
v2.5.7
v2.5.6
v2.5.5
v2.5.4
v2.5.3
v2.5.2
v2.5.1.post1
v2.5.1
v2.5.0
v2.4.3.post1
v2.4.3
v2.4.2
v2.4.1
v2.4.0.post1
v2.4.0
v2.3.6
v2.3.5
v2.3.4
v2.3.3
v2.3.2
v2.3.1.post1
v2.3.1
v2.3.0
v2.2.5
v2.2.4.post1
v2.2.4
v2.2.3.post2
v2.2.3.post1
v2.2.3
v2.2.2
v2.2.1
v2.2.0
v2.1.2.post3
v2.1.2.post2
v2.1.2.post1
v2.1.2
v2.1.1
v2.1.0
v2.0.9
v2.0.8
v2.0.7
v2.0.6.post2
v2.0.6.post1
v2.0.6
v2.0.5
v2.0.4
v2.0.3
v2.0.2
v2.0.1
v2.0.0
v1.0.9
v1.0.8
v1.0.7
v1.0.6
v1.0.5
v1.0.4
v1.0.3.post0
v1.0.3
v1.0.2
v1.0.1
v1.0.0
v0.2.8
v0.2.7
v0.2.6
v0.2.5
v0.2.4
v0.2.3
v0.2.2
v0.2.1
Cronologia Commit
Cerca
Autore
SHA1
Messaggio
Data
jayhshah
c92ca63268
FA3 FP8 qkv descales + restore max offset for h128 causal + added sync for producer WG (
#1173
)
5 mesi fa
Ying Zhang
a3a257c71d
Fix out-of-bound writes for var-seq-len zero-length KVs
5 mesi fa
jayhshah
5018ac6ac5
Fp8 kernel with "in-kernel" transpose of V in producer (
#1100
)
6 mesi fa
Ying Zhang
dfe1a59e4b
Add var-seq-len to FA3 fp16 / bf16 fwd (
#1072
)
6 mesi fa
Tri Dao
74b0761ff7
[FA3] BF16 forward
6 mesi fa
Tri Dao
7f67966cc7
FA3 initial code release
6 mesi fa