AlpinDale
|
45a004874c
chore: allow specifying custom Executor
|
5 months ago |
AlpinDale
|
b7a2d52e47
fix: allow using mp executor for pipeline parallel
|
5 months ago |
AlpinDale
|
052a6e1eb6
feat: add SPMD worker execution using Ray accelerated DAG
|
5 months ago |
AlpinDale
|
6f8beb8583
fix: 4-node crash with PP
|
5 months ago |
AlpinDale
|
23408b9b2b
chore: skip the driver worker
|
5 months ago |
AlpinDale
|
1562e073c6
fix: ray worker rank assigment
|
5 months ago |
AlpinDale
|
4c3bb0b436
fix: pipeline parallel on python 3.8 and 3.9
|
5 months ago |
AlpinDale
|
5257ebce8c
fix: device >= 0 && device < num_gpus INTERNAL_ASSERT FAILED
|
5 months ago |
AlpinDale
|
ae04f57ec1
feat: Pipeline Parallel support (#581)
|
5 months ago |
AlpinDale
|
405bb74612
Control plane comms refactor (#573)
|
5 months ago |
AlpinDale
|
323fe23b21
chore: use 127.0.0.1 for single-node setups
|
5 months ago |
AlpinDale
|
dfa59bc5f9
fix: 16 GPUs in a cluster
|
5 months ago |
AlpinDale
|
17eb1b7eb9
chore: remove ray health check
|
5 months ago |
AlpinDale
|
de62ceb18c
refactor: eliminate parallel worker per-step task scheduling overhead
|
6 months ago |
AlpinDale
|
9f3d6205ce
fix ray gpu executor
|
6 months ago |
AlpinDale
|
236be273e5
feat: tensor parallel speculative decoding (#554)
|
6 months ago |
AlpinDale
|
c6a501f682
add multiprocessing executor; make ray optional
|
6 months ago |
AlpinDale
|
ef733aee43
implement ExecuteModelData to reduce executor complexity
|
6 months ago |
AlpinDale
|
7bcf4c3fc9
centralize gpu worker construction
|
6 months ago |
AlpinDale
|
fb982981ce
num_lookahead_slots in neuron and ray executors
|
6 months ago |
AlpinDale
|
957ed7d244
type hints
|
6 months ago |
AlpinDale
|
c21af7acad
feat: `DistributedGPUExecutor` abstract class (#541)
|
6 months ago |
AlpinDale
|
199e776722
chore: move ray utils to executor dir
|
6 months ago |
AlpinDale
|
46159b107a
formatting: pt1
|
6 months ago |
AlpinDale
|
fca911ee0a
vLLM Upstream Sync (#526)
|
6 months ago |
AlpinDale
|
f894f7b176
Revert "reduce dedupe by wrapping in general worker class"
|
8 months ago |
AlpinDale
|
082b0b03bc
Revert "actually run the workers"
|
8 months ago |
AlpinDale
|
36cf32649d
actually run the workers
|
8 months ago |
AlpinDale
|
9fff6fb507
reduce dedupe by wrapping in general worker class
|
8 months ago |
AlpinDale
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
8 months ago |