AlpinDale 92cee435e2 rocm: add more quants, fix _scaled_mm call (#1062) 1 week ago
..
schemes 92cee435e2 rocm: add more quants, fix _scaled_mm call (#1062) 1 week ago
__init__.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 months ago
compressed_tensors.py f2b6dc3872 cpu: add support for W8A8 quantization via compressed-tensor (#1017) 1 week ago
compressed_tensors_moe.py 201db10f02 models: add support for Phi3 MoE 2 weeks ago
utils.py 93bc863591 feat: Machete Kernels for Hopper GPUs (#842) 1 month ago