.. |
autogptq_cuda_256.cpp
|
801eda0b7a
feat: support GPTQ 2, 3, and 8bit quants (#181)
|
1 ano atrás |
autogptq_cuda_64.cpp
|
801eda0b7a
feat: support GPTQ 2, 3, and 8bit quants (#181)
|
1 ano atrás |
autogptq_cuda_kernel_256.cu
|
801eda0b7a
feat: support GPTQ 2, 3, and 8bit quants (#181)
|
1 ano atrás |
autogptq_cuda_kernel_64.cu
|
801eda0b7a
feat: support GPTQ 2, 3, and 8bit quants (#181)
|
1 ano atrás |
compat.cuh
|
2755a48d51
merge dev branch into main (#153)
|
1 ano atrás |
matrix_view.cuh
|
8fa608aeb7
feat: replace Ray with NCCL for control plane comms (#221)
|
11 meses atrás |
q_gemm.cu
|
156f577f79
feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)
|
5 meses atrás |
qdq_2.cuh
|
801eda0b7a
feat: support GPTQ 2, 3, and 8bit quants (#181)
|
1 ano atrás |
qdq_3.cuh
|
801eda0b7a
feat: support GPTQ 2, 3, and 8bit quants (#181)
|
1 ano atrás |
qdq_4.cuh
|
8fa608aeb7
feat: replace Ray with NCCL for control plane comms (#221)
|
11 meses atrás |
qdq_8.cuh
|
801eda0b7a
feat: support GPTQ 2, 3, and 8bit quants (#181)
|
1 ano atrás |
qdq_util.cuh
|
2755a48d51
merge dev branch into main (#153)
|
1 ano atrás |