AlpinDale
|
141672a0d4
kernels: disambiguate quantized types via a new ScalarType
|
преди 5 месеца |
AlpinDale
|
e3f07b22c3
feat: support for QQQ W4A8 quantization (#612)
|
преди 5 месеца |
AlpinDale
|
598afb63dd
chore: add ignored layers for fp8 quant
|
преди 5 месеца |
AlpinDale
|
ba371fbbbd
feat: AWQ marlin kernels (#603)
|
преди 5 месеца |
AlpinDale
|
98cb1c4cd1
feat: support fp8 via `llm-compressor`
|
преди 5 месеца |