AlpinDale
|
b178ae4b4a
chore: generalize linear_method to be quant_method (#540)
|
il y a 5 mois |
AlpinDale
|
fca911ee0a
vLLM Upstream Sync (#526)
|
il y a 6 mois |
AlpinDale
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
il y a 8 mois |
AlpinDale
|
e42a78381a
feat: switch from pylint to ruff (#322)
|
il y a 10 mois |
AlpinDale
|
c2d77b1822
chore: logging refactor (#302)
|
il y a 10 mois |
AlpinDale
|
705821a7fe
feat: AQLM quantization support (#293)
|
il y a 10 mois |
AlpinDale
|
72229a94da
feat: better marlin kernels (#285)
|
il y a 10 mois |
AlpinDale
|
ea0f57b233
feat: allow further support for non-cuda devices (#247)
|
il y a 11 mois |
AlpinDale
|
c3a221eb02
feat: GGUF, QuIP#, and Marlin support (#228)
|
il y a 11 mois |
AlpinDale
|
8fa608aeb7
feat: replace Ray with NCCL for control plane comms (#221)
|
il y a 11 mois |
AlpinDale
|
801eda0b7a
feat: support GPTQ 2, 3, and 8bit quants (#181)
|
il y a 1 an |
AlpinDale
|
7c6fdea535
fix: GPTQ warnings and exllama states (#171)
|
il y a 1 an |
AlpinDale
|
62b2c4119d
feat: re-write GPTQ and refactor exllama kernels (#152)
|
il y a 1 an |
AlpinDale
|
2b1ba581f9
feat: re-implement GPTQ (#141)
|
il y a 1 an |
AlpinDale
|
e7b6a2d5a0
chore: tensor parallel refactors part 2 (#116)
|
il y a 1 an |