.. |
autogptq_cuda_256.cpp
|
801eda0b7a
feat: support GPTQ 2, 3, and 8bit quants (#181)
|
1 year ago |
autogptq_cuda_64.cpp
|
801eda0b7a
feat: support GPTQ 2, 3, and 8bit quants (#181)
|
1 year ago |
autogptq_cuda_kernel_256.cu
|
801eda0b7a
feat: support GPTQ 2, 3, and 8bit quants (#181)
|
1 year ago |
autogptq_cuda_kernel_64.cu
|
801eda0b7a
feat: support GPTQ 2, 3, and 8bit quants (#181)
|
1 year ago |
compat.cuh
|
2755a48d51
merge dev branch into main (#153)
|
1 year ago |
matrix_view.cuh
|
8fa608aeb7
feat: replace Ray with NCCL for control plane comms (#221)
|
11 months ago |
q_gemm.cu
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 months ago |
qdq_2.cuh
|
801eda0b7a
feat: support GPTQ 2, 3, and 8bit quants (#181)
|
1 year ago |
qdq_3.cuh
|
801eda0b7a
feat: support GPTQ 2, 3, and 8bit quants (#181)
|
1 year ago |
qdq_4.cuh
|
8fa608aeb7
feat: replace Ray with NCCL for control plane comms (#221)
|
11 months ago |
qdq_8.cuh
|
801eda0b7a
feat: support GPTQ 2, 3, and 8bit quants (#181)
|
1 year ago |
qdq_util.cuh
|
2755a48d51
merge dev branch into main (#153)
|
1 year ago |