AlpinDale 41beab5dc1 add exllamav2 tensor paralell, fused MoE for GPTQ/AWQ vor 9 Monaten
..
LICENSE 72229a94da feat: better marlin kernels (#285) vor 10 Monaten
marlin_cuda_kernel.cu 72229a94da feat: better marlin kernels (#285) vor 10 Monaten
marlin_cuda_kernel_zero.cu 41beab5dc1 add exllamav2 tensor paralell, fused MoE for GPTQ/AWQ vor 9 Monaten