Commit History

Author SHA1 Message Date
  AlpinDale 93bc863591 feat: Machete Kernels for Hopper GPUs (#842) 1 month ago
  AlpinDale bfc8988116 feat: add cuda sampling kernels for top_k and top_p (#828) 1 month ago
  AlpinDale 5b0eabe0e8 fix: compilation of gptq_marlin_gemm object (#800) 2 months ago
  AlpinDale f98e7b2f8c feat: add HQQ quantization support (#795) 2 months ago
  AlpinDale 0256ed236b feat: windows support (#790) 2 months ago
  Naomiusearch eee3cf5dab fix: make AMD usable (#775) 2 months ago
  AlpinDale 73177656ed feat: quant_llm support (#755) 3 months ago
  AlpinDale a401f8e05d feat: per-tensor token epilogue kernels (#630) 4 months ago
  AlpinDale f1d0b77c92 [0.6.0] Release Candidate (#481) 4 months ago