Commit History

Autor SHA1 Mensaxe Data
  Tri Dao bcd918f275 [LayerNorm] Add option to write result to out and residual_out hai 4 meses
  Tri Dao bd82d6c6eb Revert "[LayerNorm] Don't store x + residual if we don't need gradients" hai 4 meses
  Tri Dao 800401847e [LayerNorm] Don't store x + residual if we don't need gradients hai 4 meses
  Tri Dao 36587c01cb [LayerNorm] Update layer_norm_linear hai 9 meses
  Tri Dao bdcae547c7 [LayerNorm] Don't exit early in the backward pass (fix #781) hai 10 meses
  Tri Dao c9861a032d [LayerNorm] Initialize mean and rstd tensor using x.device hai 11 meses
  Tri Dao f5b308e258 [LayerNorm] Rename layernorm.py -> layer_norm.py hai 11 meses