Commit History

Author SHA1 Message Date
  AlpinDale 201db10f02 models: add support for Phi3 MoE 2 weeks ago
  AlpinDale 9f3e7c86e2 feat: add fused Marlin MoE kernel (#934) 2 weeks ago
  AlpinDale afc9a28aa0 chore: add AphroditeParameter support for FP8 quant (#902) 3 weeks ago
  AlpinDale 22a4cd4595 core: fix spec decode metrics and envs circular import (#889) 3 weeks ago
  AlpinDale 901900854e chore: consolidate environment variables within one file (#882) 4 weeks ago
  AlpinDale d34e083c48 feat: add experts_int8 support (#730) 3 months ago
  AlpinDale b0f262eec1 feat: FP8 quantization support for AMD ROCm (#729) 3 months ago
  AlpinDale 4ec08af18b chore: update fused MoE weight loading (#700) 3 months ago
  AlpinDale f1d0b77c92 [0.6.0] Release Candidate (#481) 4 months ago