AlpinDale
|
50c2434267
move megatron to a top-level directory
|
9 months ago |
AlpinDale
|
aa244761ed
formatting and typing
|
9 months ago |
AlpinDale
|
41beab5dc1
add exllamav2 tensor paralell, fused MoE for GPTQ/AWQ
|
9 months ago |
AlpinDale
|
e42a78381a
feat: switch from pylint to ruff (#322)
|
10 months ago |
AlpinDale
|
c2d77b1822
chore: logging refactor (#302)
|
10 months ago |
AlpinDale
|
705821a7fe
feat: AQLM quantization support (#293)
|
10 months ago |
AlpinDale
|
72229a94da
feat: better marlin kernels (#285)
|
10 months ago |
AlpinDale
|
ea0f57b233
feat: allow further support for non-cuda devices (#247)
|
11 months ago |
AlpinDale
|
c3a221eb02
feat: GGUF, QuIP#, and Marlin support (#228)
|
1 year ago |
AlpinDale
|
8fa608aeb7
feat: replace Ray with NCCL for control plane comms (#221)
|
1 year ago |
AlpinDale
|
801eda0b7a
feat: support GPTQ 2, 3, and 8bit quants (#181)
|
1 year ago |
AlpinDale
|
7c6fdea535
fix: GPTQ warnings and exllama states (#171)
|
1 year ago |
AlpinDale
|
62b2c4119d
feat: re-write GPTQ and refactor exllama kernels (#152)
|
1 year ago |
AlpinDale
|
2b1ba581f9
feat: re-implement GPTQ (#141)
|
1 year ago |
AlpinDale
|
e7b6a2d5a0
chore: tensor parallel refactors part 2 (#116)
|
1 year ago |