Histórico de Commits

Autor SHA1 Mensagem Data
  AlpinDale 9be43994fe feat: fbgemm quantization support (#601) há 5 meses atrás
  AlpinDale 00503b9fc1 feat: non-uniform quantization via `compressed-tensors` for llama há 5 meses atrás
  AlpinDale e1475fbec7 feat: MoE support with Pallas GMM kernel for TPUs há 6 meses atrás
  AlpinDale 4bbf66451a chore: add CustomAP interface to UnquantizedFusedMoEMethod há 6 meses atrás
  AlpinDale 1efd0f89b7 feat: support FP8 for DeepSeekV2 MoE há 6 meses atrás
  AlpinDale cf472315cc refactor: isolate FP8 from mixtral há 6 meses atrás