Historique des commits

Auteur SHA1 Message Date
  Tri Dao abbc131173 [LayerNorm] Switch from CUDA to Triton implementation il y a 11 mois
  Tri Dao 393882bc08 [LayerNorm] Implement LN with parallel residual, support dim 8k il y a 1 an
  Tri Dao 6738d9477d [LayerNorm] Implement RMS Norm il y a 1 an
  Tri Dao 8c6609ae1a [LayerNorm] Support all dimensions up to 6k (if divisible by 8) il y a 2 ans
  Tri Dao 0bf5e50038 Release training code il y a 2 ans
  Tri Dao 43ab0b5205 Mention that some CUDA extensions have only been tested on A100s il y a 2 ans
  Tri Dao 2e33fc8e36 Add GPT and ViT models il y a 2 ans
  Tri Dao fa6d1ce44f Add fused_dense and dropout_add_layernorm CUDA extensions il y a 2 ans