Historique des commits

Auteur SHA1 Message Date
  Tri Dao c3b2196652 Add Alibi to MHA, test with Baichuan-13B il y a 1 an
  Tri Dao 3557e0bb8f [MLP] Implement SwiGLU with torch jiterator il y a 1 an
  Tri Dao f1a73d0740 Run isort and black on python files il y a 1 an
  Tri Dao 364a5b4a71 [MLP] Change the check for out_features being None il y a 1 an
  Tri Dao 4c98d0b41f [MLP] Edit ParallelGatedMlp il y a 1 an
  Haodong Lyu 8ee62efca3 Implement ParallelGatedMlp (#251) il y a 1 an
  Tri Dao 75e334d407 [MLP] Add ParallelMLP il y a 1 an
  Tri Dao 96d10f6545 Implement LLaMa il y a 1 an
  Tri Dao b630aef53f Implement GatedMlp il y a 1 an
  Zhiyuan Chen 8c42415664 make mlp hidden_features defaults to 4*in_features il y a 1 an
  Tri Dao 88173a1aaf [FusedDense] Support relu, rename FusedDenseGeluDense -> FusedMLP il y a 1 an
  Tri Dao 226a1b721d Implement TensorParallel for FusedDense and FusedDenseGeluDense il y a 2 ans
  Tri Dao e68ebbe89a Simplify FusedDense il y a 2 ans
  Tri Dao 13cdceb377 Implement last_layer_subset optimization for BERT il y a 2 ans
  Tri Dao 1feb94265c [ViT] Use dropout_add_ln for the 1st layer norm il y a 2 ans
  Tri Dao d4b320b31f Add MLP, MHA, Block, Embedding modules il y a 2 ans