Commit History

Autor SHA1 Mensaxe Data
  Kevin Hu 07005806ff Add BigCode converters (#532) hai 1 ano
  Kevin Hu 4c91621a5e Inverse state dict for BERT (#527) hai 1 ano
  Tri Dao ef6d8c75d9 [GPT] Fix loading weights from HF hub hai 1 ano
  Tri Dao 0e8c46ae08 Run isort and black on test files hai 1 ano
  Tri Dao 88173a1aaf [FusedDense] Support relu, rename FusedDenseGeluDense -> FusedMLP %!s(int64=2) %!d(string=hai) anos
  Tri Dao c6ecd40a59 Tweak CrossEntropyLoss to take process_group in init %!s(int64=2) %!d(string=hai) anos
  Tri Dao 13cdceb377 Implement last_layer_subset optimization for BERT %!s(int64=2) %!d(string=hai) anos
  Tri Dao 5fb6df0e04 Implement BERT %!s(int64=2) %!d(string=hai) anos