AlpinDale 41beab5dc1 add exllamav2 tensor paralell, fused MoE for GPTQ/AWQ | il y a 9 mois | |
---|---|---|
.. | ||
dequantize.cuh | 9f7a0e3ecb feat: AWQ support for Turing GPUs (#53) | il y a 1 an |
gemm_kernels.cu | 41beab5dc1 add exllamav2 tensor paralell, fused MoE for GPTQ/AWQ | il y a 9 mois |