Commit History

Autor SHA1 Mensaxe Data
  Tri Dao abbc131173 [LayerNorm] Switch from CUDA to Triton implementation hai 1 ano
  Tri Dao f1a73d0740 Run isort and black on python files hai 1 ano
  Tri Dao ada4710d70 [ViT] Run black on vit.py hai 1 ano
  Tri Dao a81900d4c1 [ViT] Minor fix so it runs hai 1 ano
  Tri Dao 88173a1aaf [FusedDense] Support relu, rename FusedDenseGeluDense -> FusedMLP %!s(int64=2) %!d(string=hai) anos
  Tri Dao 780e8eeabb [ViT] Support timm checkpoint, add tests %!s(int64=2) %!d(string=hai) anos
  Tri Dao ef085cfcda [ViT] Fix extra norm_0, use new LN order in Block %!s(int64=2) %!d(string=hai) anos
  Tri Dao 1feb94265c [ViT] Use dropout_add_ln for the 1st layer norm %!s(int64=2) %!d(string=hai) anos
  Tri Dao 2e33fc8e36 Add GPT and ViT models %!s(int64=2) %!d(string=hai) anos