AlpinDale 41beab5dc1 add exllamav2 tensor paralell, fused MoE for GPTQ/AWQ il y a 9 mois
..
dequantize.cuh 9f7a0e3ecb feat: AWQ support for Turing GPUs (#53) il y a 1 an
gemm_kernels.cu 41beab5dc1 add exllamav2 tensor paralell, fused MoE for GPTQ/AWQ il y a 9 mois