Commit History

Autor SHA1 Mensaxe Data
  AlpinDale ae04f57ec1 feat: Pipeline Parallel support (#581) hai 7 meses
  AlpinDale 405bb74612 Control plane comms refactor (#573) hai 7 meses
  AlpinDale 2c321ce1f2 chore: upgrade to rocm 6.1, update docker hai 7 meses
  AlpinDale 323fe23b21 chore: use 127.0.0.1 for single-node setups hai 7 meses
  AlpinDale a89c9a0e92 fix: device ordinal issues with world_size and stuff hai 7 meses
  AlpinDale 427ab15434 fix: check_health when world_size==1 hai 7 meses
  AlpinDale 05d6e43244 fix: `torch.compile()` with mp executor backend hai 7 meses
  AlpinDale 5b0c11d190 support pipeline parallel pynccl groups hai 7 meses
  AlpinDale de62ceb18c refactor: eliminate parallel worker per-step task scheduling overhead hai 7 meses
  AlpinDale 236be273e5 feat: tensor parallel speculative decoding (#554) hai 7 meses
  AlpinDale c6a501f682 add multiprocessing executor; make ray optional hai 7 meses