Tri Dao
|
5400fdc4ac
[CE] Implement CrossEntropyLoss in Triton
|
há 1 ano atrás |
Tri Dao
|
43ab0b5205
Mention that some CUDA extensions have only been tested on A100s
|
há 2 anos atrás |
Tri Dao
|
fa6d1ce44f
Add fused_dense and dropout_add_layernorm CUDA extensions
|
há 2 anos atrás |