AlpinDale
|
0495c50a3e
GPTQ+exllama support (#21)
|
1 year ago |
AlpinDale
|
cbeeabeb9a
feat: mistral support (#20)
|
1 year ago |
AlpinDale
|
8576f8c1f8
fix ctxlen issues with large prompts
|
1 year ago |
AlpinDale
|
75c27d3e65
massive overhaul
|
1 year ago |
AlpinDale
|
0338f852be
fix incorrect attribute name
|
1 year ago |
AlpinDale
|
d9c1d4f6e5
add awq support
|
1 year ago |
AlpinDale
|
39beed0b87
Revert "Refactor AWQ support."
|
1 year ago |
AlpinDale
|
d09e27f5d4
Refactor AWQ support.
|
1 year ago |
AlpinDale
|
6b9561ef07
adapt TGI incremental detokenization
|
1 year ago |
AlpinDale
|
d4cd18bd94
chore: allow user to specify model context length
|
1 year ago |
AlpinDale
|
45f6d9f923
initial refactor commit
|
1 year ago |
AlpinDale
|
76b2e4a445
Merge dev branch into main (#7)
|
1 year ago |
AlpinDale
|
84d08d84ea
chore: minor refactoring and changes to the config
|
1 year ago |
AlpinDale
|
2cdfc45a40
fix: trust_remote_code fixes
|
1 year ago |
AlpinDale
|
56077f0f29
upstream: trust remote code
|
1 year ago |
AlpinDale
|
7a27bd5f2f
fix: do not allow prompt to exceed max input len
|
1 year ago |
AlpinDale
|
5169163403
chore: add tokenizer mode for slow/fast tokenizers
|
1 year ago |
AlpinDale
|
07aa2a492f
upstream: add option to specify tokenizer
|
1 year ago |
AlpinDale
|
fefbf029c9
revert previous commit
|
1 year ago |
AlpinDale
|
964ac344b2
Deploying to main from @ PygmalionAI/aphrodite-engine@9ae65dd2fe38acf8186d4a8d9ea3e54fc8e523e9 🚀
|
1 year ago |
AlpinDale
|
b02c4f6060
chore: re-arranging
|
1 year ago |