AlpinDale
|
141672a0d4
kernels: disambiguate quantized types via a new ScalarType
|
5 months ago |
AlpinDale
|
6979ff658e
chore: perform allreduce in fp32 for marlin, better logging
|
5 months ago |
AlpinDale
|
84a9cd25c9
fix: some naming issues
|
5 months ago |
AlpinDale
|
ba371fbbbd
feat: AWQ marlin kernels (#603)
|
5 months ago |
AlpinDale
|
7e9d4f3c71
chore: some more marlin cleanups
|
5 months ago |
AlpinDale
|
058e629f8e
chore: refactor marlin python utils
|
5 months ago |
AlpinDale
|
88a638d793
chore: debug logs for all available endpoints
|
5 months ago |
AlpinDale
|
98cb1c4cd1
feat: support fp8 via `llm-compressor`
|
5 months ago |