Historique des commits

Auteur SHA1 Message Date
  Tri Dao 27f8f890df [FusedDense] Allocate lt_workspace on input device il y a 1 an
  Tri Dao dec4f2e910 [FusedDense] Set workspace size to 32M for Hopper and 4M for others il y a 1 an
  Tri Dao 88173a1aaf [FusedDense] Support relu, rename FusedDenseGeluDense -> FusedMLP il y a 1 an
  Tri Dao e68ebbe89a Simplify FusedDense il y a 2 ans
  Tri Dao fa6d1ce44f Add fused_dense and dropout_add_layernorm CUDA extensions il y a 2 ans