AlpinDale 41beab5dc1 add exllamav2 tensor paralell, fused MoE for GPTQ/AWQ 9 months ago
..
quant c41462cfcd feat: exllamav2 quantization (#305) 10 months ago
compat.cuh c41462cfcd feat: exllamav2 quantization (#305) 10 months ago
matrix_view.cuh c41462cfcd feat: exllamav2 quantization (#305) 10 months ago
q_gemm_exl2.cu 41beab5dc1 add exllamav2 tensor paralell, fused MoE for GPTQ/AWQ 9 months ago
q_gemm_kernel.cuh 41beab5dc1 add exllamav2 tensor paralell, fused MoE for GPTQ/AWQ 9 months ago
q_matrix.cu 41beab5dc1 add exllamav2 tensor paralell, fused MoE for GPTQ/AWQ 9 months ago
q_matrix.cuh c41462cfcd feat: exllamav2 quantization (#305) 10 months ago