Revīziju vēsture

Autors SHA1 Ziņojums Datums
  Tri Dao 8c20cfef49 [Rotary] Support qkv block layout from GQA 3 mēneši atpakaļ
  Tri Dao c7f32a8409 [CrossEntropy] Support precomputed LSE 4 mēneši atpakaļ
  Tri Dao d79f9b41a8 [CrossEntropy] Use online softmax to simplify implementation 4 mēneši atpakaļ
  lancerts 22339db185 remove an unused import (#960) 7 mēneši atpakaļ
  Tri Dao ec6d22143b [CrossEntropy] Change ignored_index -> ignore_index 8 mēneši atpakaļ
  Curtis "Fjord" Hawthorne d8aacc510c return z_loss (#768) 11 mēneši atpakaļ
  Tri Dao 08124c8f9c [CrossEntropy] Implement logit_scale option 1 gadu atpakaļ
  Tri Dao aaa1474129 [CrossEntropy] Simplify the case of large vocab with Tensor Parallel 1 gadu atpakaļ
  Shijie abf04a56e1 fix flash ce mp large vocab (#673) 1 gadu atpakaļ
  Tri Dao c79de85ffa [CrossEntropy] Fix triton cross_entropy_loss IMA for >=2B elements 1 gadu atpakaļ
  Tri Dao 5400fdc4ac [CE] Implement CrossEntropyLoss in Triton 1 gadu atpakaļ