Tri Dao
|
c7f32a8409
[CrossEntropy] Support precomputed LSE
|
3 months ago |
Tri Dao
|
ec6d22143b
[CrossEntropy] Change ignored_index -> ignore_index
|
7 months ago |
Curtis "Fjord" Hawthorne
|
d8aacc510c
return z_loss (#768)
|
10 months ago |
Tri Dao
|
08124c8f9c
[CrossEntropy] Implement logit_scale option
|
1 year ago |
Tri Dao
|
5400fdc4ac
[CE] Implement CrossEntropyLoss in Triton
|
1 year ago |
Tri Dao
|
f1a73d0740
Run isort and black on python files
|
1 year ago |
Tri Dao
|
c6ecd40a59
Tweak CrossEntropyLoss to take process_group in init
|
1 year ago |
Tri Dao
|
dff68c2b22
Add smoothing for CrossEntropyParallel, rename to CrossEntropyLoss
|
2 years ago |