sgsdxzy
|
589fe0c73e
fix: split the exl2 weight loading and SQ+ init (#423)
|
8 months ago |
AlpinDale
|
50c2434267
move megatron to a top-level directory
|
9 months ago |
AlpinDale
|
41beab5dc1
add exllamav2 tensor paralell, fused MoE for GPTQ/AWQ
|
9 months ago |
AlpinDale
|
7b9c08afae
vision model support
|
9 months ago |
AlpinDale
|
f8dfac6372
chore: attention refactor and upstream sync apr01 (#365)
|
9 months ago |
AlpinDale
|
c2d77b1822
chore: logging refactor (#302)
|
10 months ago |
AlpinDale
|
e0c35bb353
feat: bitsandbytes and `--load-in{4,8}bit` support (#294)
|
10 months ago |
AlpinDale
|
ac82b67f75
feat: naive context shift and various QoL changes (#289)
|
10 months ago |
AlpinDale
|
697c06c4f5
fix: LoRA support for mixtral (#276)
|
10 months ago |
AlpinDale
|
224b87b484
feat: add fused mixtral moe support (#238)
|
10 months ago |
AlpinDale
|
ea0f57b233
feat: allow further support for non-cuda devices (#247)
|
11 months ago |
AlpinDale
|
4faf78ba29
fix: grab correct quant config from revisions (#246)
|
11 months ago |
AlpinDale
|
c0aac15421
feat: S-LoRA support (#222)
|
11 months ago |
AlpinDale
|
f013d714c0
chore: merge dev branch into main (#177)
|
1 year ago |
AlpinDale
|
2755a48d51
merge dev branch into main (#153)
|
1 year ago |
AlpinDale
|
989df2d84b
add Yi to quantized models
|
1 year ago |
AlpinDale
|
e59e7f0a99
feat: yi support (#104)
|
1 year ago |
AlpinDale
|
74604eb252
fix: pylint complaints (#91)
|
1 year ago |
AlpinDale
|
efc6f7fbec
chore: reformats (#90)
|
1 year ago |
AlpinDale
|
f73f2dd3c2
feat: add mistral support for GPTQ (#86)
|
1 year ago |
AlpinDale
|
f04588203e
feat: mistral AWQ support and file blacklisting
|
1 year ago |
AlpinDale
|
0495c50a3e
GPTQ+exllama support (#21)
|
1 year ago |
AlpinDale
|
cbeeabeb9a
feat: mistral support (#20)
|
1 year ago |
AlpinDale
|
1ab15649ae
fix column parallelism in quantized layers
|
1 year ago |
AlpinDale
|
75c27d3e65
massive overhaul
|
1 year ago |
AlpinDale
|
c95d80da39
fix revision issues
|
1 year ago |
AlpinDale
|
0a54cd7e26
dict to list
|
1 year ago |
AlpinDale
|
d9c1d4f6e5
add awq support
|
1 year ago |
AlpinDale
|
58c11e2178
remove revision from the loader
|
1 year ago |
AlpinDale
|
39beed0b87
Revert "Refactor AWQ support."
|
1 year ago |