Commit History

Autor SHA1 Mensaxe Data
  Tri Dao c7f32a8409 [CrossEntropy] Support precomputed LSE hai 3 meses
  Tri Dao 08124c8f9c [CrossEntropy] Implement logit_scale option hai 1 ano
  Tri Dao aaa1474129 [CrossEntropy] Simplify the case of large vocab with Tensor Parallel hai 1 ano
  Tri Dao 5400fdc4ac [CE] Implement CrossEntropyLoss in Triton hai 1 ano
  Tri Dao 0e8c46ae08 Run isort and black on test files hai 1 ano
  Tri Dao c6ecd40a59 Tweak CrossEntropyLoss to take process_group in init hai 1 ano
  Tri Dao b4018a5028 Implement Tensor Parallel for GPT model %!s(int64=2) %!d(string=hai) anos
  Tri Dao dff68c2b22 Add smoothing for CrossEntropyParallel, rename to CrossEntropyLoss %!s(int64=2) %!d(string=hai) anos
  Tri Dao 343492ec30 Make nccl operations async in CrossEntropyLossParallel %!s(int64=2) %!d(string=hai) anos
  Tri Dao 7c9953815a Add fused cross entropy loss %!s(int64=2) %!d(string=hai) anos