AlpinDale 41beab5dc1 add exllamav2 tensor paralell, fused MoE for GPTQ/AWQ | 10 달 전 | |
---|---|---|
.. | ||
LICENSE | 72229a94da feat: better marlin kernels (#285) | 11 달 전 |
marlin_cuda_kernel.cu | 72229a94da feat: better marlin kernels (#285) | 11 달 전 |
marlin_cuda_kernel_zero.cu | 41beab5dc1 add exllamav2 tensor paralell, fused MoE for GPTQ/AWQ | 10 달 전 |