Commit Verlauf

Autor SHA1 Nachricht Datum
  Tri Dao abbc131173 [LayerNorm] Switch from CUDA to Triton implementation vor 11 Monaten
  Tri Dao 393882bc08 [LayerNorm] Implement LN with parallel residual, support dim 8k vor 1 Jahr
  Tri Dao 6738d9477d [LayerNorm] Implement RMS Norm vor 1 Jahr
  Tri Dao 8c6609ae1a [LayerNorm] Support all dimensions up to 6k (if divisible by 8) vor 2 Jahren
  Tri Dao 0bf5e50038 Release training code vor 2 Jahren
  Tri Dao 43ab0b5205 Mention that some CUDA extensions have only been tested on A100s vor 2 Jahren
  Tri Dao 2e33fc8e36 Add GPT and ViT models vor 2 Jahren
  Tri Dao fa6d1ce44f Add fused_dense and dropout_add_layernorm CUDA extensions vor 2 Jahren