Commit History

Author SHA1 Message Date
  AlpinDale 141672a0d4 kernels: disambiguate quantized types via a new ScalarType 5 months ago
  AlpinDale 6979ff658e chore: perform allreduce in fp32 for marlin, better logging 5 months ago
  AlpinDale 84a9cd25c9 fix: some naming issues 5 months ago
  AlpinDale ba371fbbbd feat: AWQ marlin kernels (#603) 5 months ago
  AlpinDale 7e9d4f3c71 chore: some more marlin cleanups 5 months ago
  AlpinDale 058e629f8e chore: refactor marlin python utils 5 months ago
  AlpinDale 88a638d793 chore: debug logs for all available endpoints 5 months ago
  AlpinDale 98cb1c4cd1 feat: support fp8 via `llm-compressor` 5 months ago