.. |
quant
|
c41462cfcd
feat: exllamav2 quantization (#305)
|
10 months ago |
compat.cuh
|
c41462cfcd
feat: exllamav2 quantization (#305)
|
10 months ago |
matrix_view.cuh
|
c41462cfcd
feat: exllamav2 quantization (#305)
|
10 months ago |
q_gemm_exl2.cu
|
41beab5dc1
add exllamav2 tensor paralell, fused MoE for GPTQ/AWQ
|
9 months ago |
q_gemm_kernel.cuh
|
41beab5dc1
add exllamav2 tensor paralell, fused MoE for GPTQ/AWQ
|
9 months ago |
q_matrix.cu
|
41beab5dc1
add exllamav2 tensor paralell, fused MoE for GPTQ/AWQ
|
9 months ago |
q_matrix.cuh
|
c41462cfcd
feat: exllamav2 quantization (#305)
|
10 months ago |