Commit History

Autor SHA1 Mensaxe Data
  AlpinDale 156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569) hai 5 meses
  AlpinDale aba03b4756 feat: dynamic per-token activation quantization hai 6 meses
  AlpinDale f8dfac6372 chore: attention refactor and upstream sync apr01 (#365) hai 10 meses
  AlpinDale 9810daa699 feat: INT8 KV Cache (#298) hai 11 meses
  AlpinDale d9b65e6c5f feat: DeepSeek MoE support (#237) hai 1 ano
  AlpinDale 31c95011a6 feat: FP8 E5M2 KV Cache (#226) hai 1 ano
  AlpinDale 8fa608aeb7 feat: replace Ray with NCCL for control plane comms (#221) hai 1 ano
  AlpinDale 7e72ce0a73 feat: mixtral tensor parallelism (#193) hai 1 ano
  AlpinDale 15a0454172 feat: FP8 KV Cache (#185) hai 1 ano
  AlpinDale 32844c1522 add GELU kernels and remove compile bloat hai 1 ano