Commit History

Autor SHA1 Mensaxe Data
  Zhihao Shen 30e1ef0f79 minify torch.torch.int32 to torch.int32 (#1237) hai 2 meses
  Ying Zhang cdbbe844b1 minor changes to unpad_input test util func hai 3 meses
  Tri Dao abbc131173 [LayerNorm] Switch from CUDA to Triton implementation hai 11 meses
  Kevin Hu 07005806ff Add BigCode converters (#532) hai 1 ano
  Kevin Hu 4c91621a5e Inverse state dict for BERT (#527) hai 1 ano
  Tri Dao f1a73d0740 Run isort and black on python files hai 1 ano
  Kiarash Jamali 684196b8c5 Allow rotary embeddings for Bert (#363) hai 1 ano
  Tri Dao 96d10f6545 Implement LLaMa hai 1 ano
  Tri Dao 88173a1aaf [FusedDense] Support relu, rename FusedDenseGeluDense -> FusedMLP hai 1 ano
  Tri Dao ff34123bd4 Reorder LN in Block, support OPT hai 1 ano
  Tri Dao 714c1b4f0f [Bert] Fix embedding layer norm before embedding dropout hai 1 ano
  Tri Dao c6ecd40a59 Tweak CrossEntropyLoss to take process_group in init %!s(int64=2) %!d(string=hai) anos
  Tri Dao dff68c2b22 Add smoothing for CrossEntropyParallel, rename to CrossEntropyLoss %!s(int64=2) %!d(string=hai) anos
  Tri Dao e68ebbe89a Simplify FusedDense %!s(int64=2) %!d(string=hai) anos
  Tri Dao 13cdceb377 Implement last_layer_subset optimization for BERT %!s(int64=2) %!d(string=hai) anos
  Tri Dao 5fb6df0e04 Implement BERT %!s(int64=2) %!d(string=hai) anos