AlpinDale
|
69a4c32b01
fix: openai server (#19)
|
1 year ago |
AlpinDale
|
cbeeabeb9a
feat: mistral support (#20)
|
1 year ago |
AlpinDale
|
8576f8c1f8
fix ctxlen issues with large prompts
|
1 year ago |
AlpinDale
|
ca43123a30
add github action to auto-build wheels
|
1 year ago |
AlpinDale
|
75c27d3e65
massive overhaul
|
1 year ago |
AlpinDale
|
4cdf165ee9
fix engine args
|
1 year ago |
AlpinDale
|
d9c1d4f6e5
add awq support
|
1 year ago |
AlpinDale
|
39beed0b87
Revert "Refactor AWQ support."
|
1 year ago |
AlpinDale
|
d09e27f5d4
Refactor AWQ support.
|
1 year ago |
AlpinDale
|
6b9561ef07
adapt TGI incremental detokenization
|
1 year ago |
AlpinDale
|
d4cd18bd94
chore: allow user to specify model context length
|
1 year ago |
AlpinDale
|
0115e55972
chore: add max log length
|
1 year ago |
AlpinDale
|
45f6d9f923
initial refactor commit
|
1 year ago |
AlpinDale
|
76b2e4a445
Merge dev branch into main (#7)
|
1 year ago |
AlpinDale
|
97bb098066
fix: typo lol
|
1 year ago |
AlpinDale
|
f4bb602b74
chore: remove redundant import and minor refactor
|
1 year ago |
AlpinDale
|
56077f0f29
upstream: trust remote code
|
1 year ago |
AlpinDale
|
7a27bd5f2f
fix: do not allow prompt to exceed max input len
|
1 year ago |
AlpinDale
|
5169163403
chore: add tokenizer mode for slow/fast tokenizers
|
1 year ago |
AlpinDale
|
07aa2a492f
upstream: add option to specify tokenizer
|
1 year ago |
AlpinDale
|
beb966180b
fix: various typo and import error fixes
|
1 year ago |
AlpinDale
|
8f7853c255
fix: typo in args_tools.py
|
1 year ago |
AlpinDale
|
646b514323
feat: add draft for async engine
|
1 year ago |
AlpinDale
|
2e86d50e19
feat: draft for ray
|
1 year ago |