Histórico de Commits

Autor SHA1 Mensagem Data
  AlpinDale 5d98b7ead1 fix: input_scale for w8a8 is optional há 5 meses atrás
  AlpinDale 9be43994fe feat: fbgemm quantization support (#601) há 5 meses atrás
  AlpinDale d3c474d219 chore: enable dynamic per-token `fp8` há 5 meses atrás
  AlpinDale e90ad4acec chore: implement fallback for fp8 channelwise using torch._scaled_mm há 5 meses atrás
  AlpinDale b5d23ab6d4 chore: enable bias w/ FP8 layers in CUTLASS kernels há 5 meses atrás
  AlpinDale 500f3b654f fix: support bias term in compressed-tensors quant há 5 meses atrás
  AlpinDale 98cb1c4cd1 feat: support fp8 via `llm-compressor` há 5 meses atrás