Commit History

Author SHA1 Message Date
  Tri Dao abbc131173 [LayerNorm] Switch from CUDA to Triton implementation 1 year ago
  Tri Dao f1a73d0740 Run isort and black on python files 1 year ago
  Tri Dao ada4710d70 [ViT] Run black on vit.py 1 year ago
  Tri Dao a81900d4c1 [ViT] Minor fix so it runs 1 year ago
  Tri Dao 88173a1aaf [FusedDense] Support relu, rename FusedDenseGeluDense -> FusedMLP 2 years ago
  Tri Dao 780e8eeabb [ViT] Support timm checkpoint, add tests 2 years ago
  Tri Dao ef085cfcda [ViT] Fix extra norm_0, use new LN order in Block 2 years ago
  Tri Dao 1feb94265c [ViT] Use dropout_add_ln for the 1st layer norm 2 years ago
  Tri Dao 2e33fc8e36 Add GPT and ViT models 2 years ago