AlpinDale
|
141672a0d4
kernels: disambiguate quantized types via a new ScalarType
|
5 months ago |
AlpinDale
|
49a2836d61
fix: divide-by-zero warnings in marlin kernels
|
5 months ago |
AlpinDale
|
156f577f79
feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)
|
6 months ago |
AlpinDale
|
d8667fcb98
improve gptq_marlin_24 prefill performance
|
6 months ago |
AlpinDale
|
3bdeb3e116
fix: clang formatting for all kernels (#558)
|
6 months ago |
AlpinDale
|
c66b1b57b1
Marlin 2:4 sparsity (#555)
|
6 months ago |