AlpinDale
|
9d9722b1c1
fix: metrics endpoint with RPC server (#747)
|
3 meses atrás |
AlpinDale
|
7debd35ca2
fix: shut down ray dag workers cleanly (#692)
|
4 meses atrás |
AlpinDale
|
19ad952dd4
chore: better stream termination in async engine (#672)
|
4 meses atrás |
AlpinDale
|
62111fab17
feat: allow serving encoder-decoder models in the API server (#664)
|
4 meses atrás |
AlpinDale
|
ed9a6f97c1
fix: kill api server when pinging dead engine (#660)
|
4 meses atrás |
AlpinDale
|
77c4fbd5c9
fix: better async request cancellation (#641)
|
4 meses atrás |
AlpinDale
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 meses atrás |
AlpinDale
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
8 meses atrás |
AlpinDale
|
0f6d56b07f
feat: model executor refactor (#367)
|
9 meses atrás |
AlpinDale
|
b361096463
fix: tokenizer when using ray (#366)
|
9 meses atrás |
AlpinDale
|
f8dfac6372
chore: attention refactor and upstream sync apr01 (#365)
|
9 meses atrás |
AlpinDale
|
c2d77b1822
chore: logging refactor (#302)
|
10 meses atrás |
AlpinDale
|
2d3d44b3e9
chore: add health check for ray workers (#290)
|
10 meses atrás |
AlpinDale
|
ac82b67f75
feat: naive context shift and various QoL changes (#289)
|
10 meses atrás |
AlpinDale
|
657aec0cbd
refactor: OpenAI endpoint (#261)
|
10 meses atrás |
AlpinDale
|
6a63ab4ec3
fix: remote encode request if using ray (#270)
|
10 meses atrás |
AlpinDale
|
c0146ed00e
chore: slight refactor for async engine finish (#225)
|
11 meses atrás |
AlpinDale
|
c0aac15421
feat: S-LoRA support (#222)
|
11 meses atrás |
AlpinDale
|
8fa608aeb7
feat: replace Ray with NCCL for control plane comms (#221)
|
11 meses atrás |
AlpinDale
|
b9b295d74e
chore: backlogs 1 (#191)
|
1 ano atrás |
AlpinDale
|
980673ffb7
fix: fractional gpus (#157)
|
1 ano atrás |
AlpinDale
|
e7b6a2d5a0
chore: tensor parallel refactors part 2 (#116)
|
1 ano atrás |
AlpinDale
|
035878898f
bug: minor ray issue
|
1 ano atrás |
AlpinDale
|
74604eb252
fix: pylint complaints (#91)
|
1 ano atrás |
AlpinDale
|
efc6f7fbec
chore: reformats (#90)
|
1 ano atrás |
AlpinDale
|
75c27d3e65
massive overhaul
|
1 ano atrás |
AlpinDale
|
c8c0b2f369
fix exception error for async
|
1 ano atrás |
AlpinDale
|
0115e55972
chore: add max log length
|
1 ano atrás |
AlpinDale
|
45f6d9f923
initial refactor commit
|
1 ano atrás |
AlpinDale
|
76b2e4a445
Merge dev branch into main (#7)
|
1 ano atrás |