Commit History

Autor SHA1 Mensaxe Data
  AlpinDale e14223dce5 kernel: use `cub::BlockReduce` instead of custom impl (#895) hai 3 semanas
  AlpinDale f1d0b77c92 [0.6.0] Release Candidate (#481) hai 4 meses
  AlpinDale 9d81716bfd [v0.5.3] Release Candidate (#388) hai 8 meses
  AlpinDale 8fa608aeb7 feat: replace Ray with NCCL for control plane comms (#221) hai 11 meses
  AlpinDale b9b295d74e chore: backlogs 1 (#191) hai 1 ano
  AlpinDale 7612f33afd feat: fused add RMSNorm kernels (#125) hai 1 ano
  AlpinDale 3d72f05c7b feat: flattened 1D tensor -> 2D tensor (#85) hai 1 ano
  AlpinDale 32844c1522 add GELU kernels and remove compile bloat hai 1 ano
  AlpinDale 081545bde6 fix: various CUDA kernel tweaks hai 1 ano
  AlpinDale b8f4337c5b chore: various fixes hai 1 ano
  AlpinDale 0ec53128b6 feat: add layernorm kernels hai 1 ano