AlpinDale
|
a89c9a0e92
fix: device ordinal issues with world_size and stuff
|
7 months ago |
AlpinDale
|
427ab15434
fix: check_health when world_size==1
|
7 months ago |
AlpinDale
|
05d6e43244
fix: `torch.compile()` with mp executor backend
|
7 months ago |
AlpinDale
|
5b0c11d190
support pipeline parallel pynccl groups
|
7 months ago |
AlpinDale
|
de62ceb18c
refactor: eliminate parallel worker per-step task scheduling overhead
|
7 months ago |
AlpinDale
|
236be273e5
feat: tensor parallel speculative decoding (#554)
|
7 months ago |
AlpinDale
|
c6a501f682
add multiprocessing executor; make ray optional
|
7 months ago |