Histórico de Commits

Autor SHA1 Mensagem Data
  AlpinDale ba371fbbbd feat: AWQ marlin kernels (#603) há 4 meses atrás
  AlpinDale 9be43994fe feat: fbgemm quantization support (#601) há 4 meses atrás
  AlpinDale 7e9d4f3c71 chore: some more marlin cleanups há 4 meses atrás
  AlpinDale 058e629f8e chore: refactor marlin python utils há 4 meses atrás
  AlpinDale 98cb1c4cd1 feat: support fp8 via `llm-compressor` há 4 meses atrás
  AlpinDale cda0e93a10 abstract away the platform for device capability há 4 meses atrás
  AlpinDale 0f4a9ee77b quantized lm_head (#582) há 4 meses atrás
  AlpinDale 7d79c0e726 chore: use nvml query to avoid accidental cuda initialization há 4 meses atrás
  AlpinDale 156f577f79 feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569) há 5 meses atrás
  AlpinDale 5cedee9024 fix gemma with gptq marlin há 5 meses atrás
  AlpinDale 5b0c11d190 support pipeline parallel pynccl groups há 5 meses atrás
  AlpinDale c66b1b57b1 Marlin 2:4 sparsity (#555) há 5 meses atrás
  AlpinDale ad1c6b86a1 gptq_marlin: enable bfloat16 há 5 meses atrás
  AlpinDale c154578c97 gptq_marlin: 8bit GPTQ support há 5 meses atrás
  AlpinDale ac5b4b6aa7 broadcast metadata through cpu há 5 meses atrás
  AlpinDale f22b700ee4 feat: marlin kernels for GPTQ (#547) há 5 meses atrás