Tri Dao
|
88173a1aaf
[FusedDense] Support relu, rename FusedDenseGeluDense -> FusedMLP
|
пре 1 година |
Tri Dao
|
43798966cf
[Docs] Fix formatting
|
пре 1 година |
Tri Dao
|
3c7cbfc195
[Docs] Mention that dropout_layer_norm supports all dims up to 6k
|
пре 1 година |
Tri Dao
|
4a6eaa9f27
Update configs, add results
|
пре 2 година |
Tri Dao
|
0bf5e50038
Release training code
|
пре 2 година |