AlpinDale
|
7e72ce0a73
feat: mixtral tensor parallelism (#193)
|
1 year ago |
AlpinDale
|
15a0454172
feat: FP8 KV Cache (#185)
|
1 year ago |
AlpinDale
|
801eda0b7a
feat: support GPTQ 2, 3, and 8bit quants (#181)
|
1 year ago |
AlpinDale
|
7c6fdea535
fix: GPTQ warnings and exllama states (#171)
|
1 year ago |
AlpinDale
|
02f3ab3501
fix: replace head_mapping with num_kv_heads (#161)
|
1 year ago |
AlpinDale
|
62b2c4119d
feat: re-write GPTQ and refactor exllama kernels (#152)
|
1 year ago |
AlpinDale
|
1334a833a4
feat: AMD ROCm support (#95)
|
1 year ago |
AlpinDale
|
2b1ba581f9
feat: re-implement GPTQ (#141)
|
1 year ago |
AlpinDale
|
8223f85c1b
feat: SqueezeLLM support (#140)
|
1 year ago |
AlpinDale
|
1aab8a7d6f
feat: speedup compilation times by 3x (#130)
|
1 year ago |