AlpinDale 41beab5dc1 add exllamav2 tensor paralell, fused MoE for GPTQ/AWQ | 9 months ago | |
---|---|---|
.. | ||
dequantize.cuh | 9f7a0e3ecb feat: AWQ support for Turing GPUs (#53) | 1 year ago |
gemm_kernels.cu | 41beab5dc1 add exllamav2 tensor paralell, fused MoE for GPTQ/AWQ | 9 months ago |