Commit History

Author SHA1 Message Date
  AlpinDale f91991f584 fix: f-string fixes 5 months ago
  AlpinDale dba22e4f83 fix: add zeromq fallback for broadcasting large objects (e.g. vlm images) 5 months ago
  AlpinDale 7d79c0e726 chore: use nvml query to avoid accidental cuda initialization 5 months ago
  AlpinDale a89c9a0e92 fix: device ordinal issues with world_size and stuff 5 months ago
  AlpinDale 34b41e0a87 chore: add coordinator to reduce code duplication in tp and pp 5 months ago
  AlpinDale 270bd333af chore: check if process is on the same node 5 months ago
  AlpinDale b2fd915c35 improve p2p access check 5 months ago
  AlpinDale b984fe4a91 refactor custom allreduce to support multiple tp groups 6 months ago
  AlpinDale 47a5c5c00c don't check the full nvlink connectivity 6 months ago
  AlpinDale 096d9eb6c5 enhance nvlink detection 8 months ago
  AlpinDale 9d81716bfd [v0.5.3] Release Candidate (#388) 8 months ago