AlpinDale ba371fbbbd feat: AWQ marlin kernels (#603) 4 months ago
..
__init__.py 98cb1c4cd1 feat: support fp8 via `llm-compressor` 4 months ago
marlin_utils.py ba371fbbbd feat: AWQ marlin kernels (#603) 4 months ago
marlin_utils_fp8.py 055963b252 fix: channel-wise fp8 marlin 4 months ago
marlin_utils_test.py ba371fbbbd feat: AWQ marlin kernels (#603) 4 months ago
marlin_utils_test_24.py 058e629f8e chore: refactor marlin python utils 4 months ago
quant_utils.py ba371fbbbd feat: AWQ marlin kernels (#603) 4 months ago
w8a8_utils.py 5d98b7ead1 fix: input_scale for w8a8 is optional 4 months ago