AlpinDale
|
edec2e9a9e
feat: migrate awq and awq_marlin to AphroditeParameter (#702)
|
4 месяцев назад |
AlpinDale
|
4f6020cc86
chore: migrate gptq_marlin to AphroditeParameters (#699)
|
4 месяцев назад |
AlpinDale
|
df208ab4e9
fix: fp8 checkpoints with fused linear modules (#677)
|
4 месяцев назад |
AlpinDale
|
f1e1d0bd3d
feat: introduce `BaseAphroditeParameter` (#646)
|
4 месяцев назад |
AlpinDale
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 месяцев назад |
AlpinDale
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
8 месяцев назад |
AlpinDale
|
e42a78381a
feat: switch from pylint to ruff (#322)
|
10 месяцев назад |
AlpinDale
|
c2d77b1822
chore: logging refactor (#302)
|
10 месяцев назад |
AlpinDale
|
705821a7fe
feat: AQLM quantization support (#293)
|
10 месяцев назад |
AlpinDale
|
72229a94da
feat: better marlin kernels (#285)
|
10 месяцев назад |
AlpinDale
|
ea0f57b233
feat: allow further support for non-cuda devices (#247)
|
11 месяцев назад |
AlpinDale
|
c3a221eb02
feat: GGUF, QuIP#, and Marlin support (#228)
|
11 месяцев назад |
AlpinDale
|
8fa608aeb7
feat: replace Ray with NCCL for control plane comms (#221)
|
11 месяцев назад |
AlpinDale
|
801eda0b7a
feat: support GPTQ 2, 3, and 8bit quants (#181)
|
1 год назад |
AlpinDale
|
7c6fdea535
fix: GPTQ warnings and exllama states (#171)
|
1 год назад |
AlpinDale
|
62b2c4119d
feat: re-write GPTQ and refactor exllama kernels (#152)
|
1 год назад |
AlpinDale
|
2b1ba581f9
feat: re-implement GPTQ (#141)
|
1 год назад |
AlpinDale
|
e7b6a2d5a0
chore: tensor parallel refactors part 2 (#116)
|
1 год назад |