AlpinDale
|
63c28919a0
Revert "fix: correct auto ntk scaling_factor for 4k ctx case" (#149)
|
1 year ago |
AlpinDale
|
2b1ba581f9
feat: re-implement GPTQ (#141)
|
1 year ago |
AlpinDale
|
8223f85c1b
feat: SqueezeLLM support (#140)
|
1 year ago |
AlpinDale
|
237d2ec28d
fix: CPU OOM for large models (#128)
|
1 year ago |
AlpinDale
|
0d51eac374
feat: awq for all models (#124)
|
1 year ago |
AlpinDale
|
fd18a1d956
fix: get_tensor instead of pysafeslice
|
1 year ago |
AlpinDale
|
5ea6889cea
chore: read from quantization_config (#123)
|
1 year ago |
AlpinDale
|
3459f1c185
feat: usage stats for OpenAI endpoint (#122)
|
1 year ago |
AlpinDale
|
1323b5456c
parse torch.dtype properly (#119)
|
1 year ago |
AlpinDale
|
e7b6a2d5a0
chore: tensor parallel refactors part 2 (#116)
|
1 year ago |
AlpinDale
|
5175605f8d
fix: yarn (#112)
|
1 year ago |
sandwichdoge
|
99293aaff0
fix: correct auto ntk scaling_factor for 4k ctx case (#101)
|
1 year ago |
AlpinDale
|
8834ecf9de
chore: clean up refactor endpoints (#98)
|
1 year ago |
AlpinDale
|
74604eb252
fix: pylint complaints (#91)
|
1 year ago |
AlpinDale
|
efc6f7fbec
chore: reformats (#90)
|
1 year ago |
AlpinDale
|
3d72f05c7b
feat: flattened 1D tensor -> 2D tensor (#85)
|
1 year ago |
AlpinDale
|
c1fa7e8567
chore: fix datatype check (#65)
|
1 year ago |
AlpinDale
|
0495c50a3e
GPTQ+exllama support (#21)
|
1 year ago |
AlpinDale
|
cbeeabeb9a
feat: mistral support (#20)
|
1 year ago |
AlpinDale
|
8576f8c1f8
fix ctxlen issues with large prompts
|
1 year ago |
AlpinDale
|
75c27d3e65
massive overhaul
|
1 year ago |
AlpinDale
|
0338f852be
fix incorrect attribute name
|
1 year ago |
AlpinDale
|
d9c1d4f6e5
add awq support
|
1 year ago |
AlpinDale
|
39beed0b87
Revert "Refactor AWQ support."
|
1 year ago |
AlpinDale
|
d09e27f5d4
Refactor AWQ support.
|
1 year ago |
AlpinDale
|
6b9561ef07
adapt TGI incremental detokenization
|
1 year ago |
AlpinDale
|
d4cd18bd94
chore: allow user to specify model context length
|
1 year ago |
AlpinDale
|
45f6d9f923
initial refactor commit
|
1 year ago |
AlpinDale
|
76b2e4a445
Merge dev branch into main (#7)
|
1 year ago |
AlpinDale
|
84d08d84ea
chore: minor refactoring and changes to the config
|
1 year ago |