AlpinDale
|
9a7d5514c4
feat: introduce MQAphroditeEngine (#1056)
|
пре 3 недеља |
AlpinDale
|
638c08d9dc
fix: clean shutdown issues (#1047)
|
пре 4 недеља |
AlpinDale
|
05be6085ec
core: factor out input preprocessing into a separate class (#1039)
|
пре 4 недеља |
AlpinDale
|
ddaefd8d38
chore: remove engine_use_ray (#1024)
|
пре 4 недеља |
AlpinDale
|
f561a54a43
core: fix async postprocessor in case of preemption (#1000)
|
пре 1 месец |
AlpinDale
|
09dab16f82
core: improve async postproc + multi-step performance (#983)
|
пре 1 месец |
AlpinDale
|
0dfa6b60ec
core: support logprobs with multi-step scheduling (#963)
|
пре 1 месец |
AlpinDale
|
5be6225f38
core: support multi-step scheduling w/ async post-processor (#955)
|
пре 1 месец |
AlpinDale
|
0e2bfccda0
core: add virtual engine for async outproc (#939)
|
пре 1 месец |
AlpinDale
|
15cb8d5c26
xpu: support pipeline parallel (#932)
|
пре 1 месец |
AlpinDale
|
f7f3fed265
feat: add async postprocessor (#925)
|
пре 1 месец |
AlpinDale
|
b1492c1529
core: add multi-step scheduling support for the synchronous engine (#914)
|
пре 1 месец |
AlpinDale
|
22a4cd4595
core: fix spec decode metrics and envs circular import (#889)
|
пре 1 месец |
AlpinDale
|
901900854e
chore: consolidate environment variables within one file (#882)
|
пре 1 месец |
AlpinDale
|
b5aa11020b
api: fix crashes under very high loads (#878)
|
пре 1 месец |
AlpinDale
|
5bd4473bb6
async: avoid premature exit in the async generator (#872)
|
пре 1 месец |
AlpinDale
|
48a8693aed
feat: multi-step scheduling (#831)
|
пре 2 месеци |
AlpinDale
|
dcb794a340
fix: revert incorrect commit
|
пре 3 месеци |
AlpinDale
|
76367b5ae7
wip
|
пре 3 месеци |
AlpinDale
|
9d9722b1c1
fix: metrics endpoint with RPC server (#747)
|
пре 4 месеци |
AlpinDale
|
7debd35ca2
fix: shut down ray dag workers cleanly (#692)
|
пре 4 месеци |
AlpinDale
|
19ad952dd4
chore: better stream termination in async engine (#672)
|
пре 4 месеци |
AlpinDale
|
62111fab17
feat: allow serving encoder-decoder models in the API server (#664)
|
пре 4 месеци |
AlpinDale
|
ed9a6f97c1
fix: kill api server when pinging dead engine (#660)
|
пре 4 месеци |
AlpinDale
|
77c4fbd5c9
fix: better async request cancellation (#641)
|
пре 4 месеци |
AlpinDale
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
пре 4 месеци |
AlpinDale
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
пре 8 месеци |
AlpinDale
|
0f6d56b07f
feat: model executor refactor (#367)
|
пре 9 месеци |
AlpinDale
|
b361096463
fix: tokenizer when using ray (#366)
|
пре 9 месеци |
AlpinDale
|
f8dfac6372
chore: attention refactor and upstream sync apr01 (#365)
|
пре 9 месеци |