AlpinDale 41beab5dc1 add exllamav2 tensor paralell, fused MoE for GPTQ/AWQ | il y a 9 mois | |
---|---|---|
.. | ||
LICENSE | 72229a94da feat: better marlin kernels (#285) | il y a 10 mois |
marlin_cuda_kernel.cu | 72229a94da feat: better marlin kernels (#285) | il y a 10 mois |
marlin_cuda_kernel_zero.cu | 41beab5dc1 add exllamav2 tensor paralell, fused MoE for GPTQ/AWQ | il y a 9 mois |