Commit History

Author SHA1 Message Date
  Tri Dao dff68c2b22 Add smoothing for CrossEntropyParallel, rename to CrossEntropyLoss 2 years ago
  Tri Dao e68ebbe89a Simplify FusedDense 2 years ago
  Tri Dao 13cdceb377 Implement last_layer_subset optimization for BERT 2 years ago
  Tri Dao 5fb6df0e04 Implement BERT 2 years ago