1
0

Коммит түүх

Эзэн SHA1 Мессеж Огноо
  Tri Dao f1a73d0740 Run isort and black on python files 1 жил өмнө
  Xuechen Li bb4cded17b support when num_heads is not divisible by world_size; resolves #459 (#461) 1 жил өмнө
  Tri Dao 93383bd55b [TP] Implement TensorParallel without sequence parallel 1 жил өмнө
  Tri Dao c6ecd40a59 Tweak CrossEntropyLoss to take process_group in init 1 жил өмнө
  Tri Dao b4018a5028 Implement Tensor Parallel for GPT model 2 жил өмнө
  Tri Dao 226a1b721d Implement TensorParallel for FusedDense and FusedDenseGeluDense 2 жил өмнө