AlpinDale
|
c2d77b1822
chore: logging refactor (#302)
|
10 months ago |
AlpinDale
|
2d3d44b3e9
chore: add health check for ray workers (#290)
|
10 months ago |
AlpinDale
|
ac82b67f75
feat: naive context shift and various QoL changes (#289)
|
10 months ago |
AlpinDale
|
657aec0cbd
refactor: OpenAI endpoint (#261)
|
10 months ago |
AlpinDale
|
6a63ab4ec3
fix: remote encode request if using ray (#270)
|
10 months ago |
AlpinDale
|
c0146ed00e
chore: slight refactor for async engine finish (#225)
|
11 months ago |
AlpinDale
|
c0aac15421
feat: S-LoRA support (#222)
|
11 months ago |
AlpinDale
|
8fa608aeb7
feat: replace Ray with NCCL for control plane comms (#221)
|
11 months ago |
AlpinDale
|
b9b295d74e
chore: backlogs 1 (#191)
|
1 year ago |
AlpinDale
|
980673ffb7
fix: fractional gpus (#157)
|
1 year ago |
AlpinDale
|
e7b6a2d5a0
chore: tensor parallel refactors part 2 (#116)
|
1 year ago |
AlpinDale
|
035878898f
bug: minor ray issue
|
1 year ago |
AlpinDale
|
74604eb252
fix: pylint complaints (#91)
|
1 year ago |
AlpinDale
|
efc6f7fbec
chore: reformats (#90)
|
1 year ago |
AlpinDale
|
75c27d3e65
massive overhaul
|
1 year ago |
AlpinDale
|
c8c0b2f369
fix exception error for async
|
1 year ago |
AlpinDale
|
0115e55972
chore: add max log length
|
1 year ago |
AlpinDale
|
45f6d9f923
initial refactor commit
|
1 year ago |
AlpinDale
|
76b2e4a445
Merge dev branch into main (#7)
|
1 year ago |
AlpinDale
|
90f0b4a47e
chore: minor tweaks and fixes to the async engine
|
1 year ago |
AlpinDale
|
a69f1ecf51
chore: qol improvements
|
1 year ago |
AlpinDale
|
724852dc31
chore: refactoring cont.
|
1 year ago |
AlpinDale
|
dd1dd2fdbd
fix: endless loop in async engine
|
1 year ago |
AlpinDale
|
beb966180b
fix: various typo and import error fixes
|
1 year ago |
AlpinDale
|
646b514323
feat: add draft for async engine
|
1 year ago |