.. |
__init__.py
|
98cb1c4cd1
feat: support fp8 via `llm-compressor`
|
4 months ago |
marlin_utils.py
|
ba371fbbbd
feat: AWQ marlin kernels (#603)
|
4 months ago |
marlin_utils_fp8.py
|
055963b252
fix: channel-wise fp8 marlin
|
4 months ago |
marlin_utils_test.py
|
ba371fbbbd
feat: AWQ marlin kernels (#603)
|
4 months ago |
marlin_utils_test_24.py
|
058e629f8e
chore: refactor marlin python utils
|
4 months ago |
quant_utils.py
|
ba371fbbbd
feat: AWQ marlin kernels (#603)
|
4 months ago |
w8a8_utils.py
|
5d98b7ead1
fix: input_scale for w8a8 is optional
|
4 months ago |