AlpinDale 92cee435e2 rocm: add more quants, fix _scaled_mm call (#1062) vor 3 Wochen
..
schemes 92cee435e2 rocm: add more quants, fix _scaled_mm call (#1062) vor 3 Wochen
__init__.py f1d0b77c92 [0.6.0] Release Candidate (#481) vor 4 Monaten
compressed_tensors.py f2b6dc3872 cpu: add support for W8A8 quantization via compressed-tensor (#1017) vor 4 Wochen
compressed_tensors_moe.py 201db10f02 models: add support for Phi3 MoE vor 1 Monat
utils.py 93bc863591 feat: Machete Kernels for Hopper GPUs (#842) vor 1 Monat