Commit History

Autor SHA1 Mensaxe Data
  AlpinDale 4d4e767838 ci: take one of fixing lint issues hai 5 meses
  AlpinDale cd31f8efbb chore: optimize PP comm by replacing send with partial send + allgather hai 5 meses
  AlpinDale 705e50f4bd fix: broadcasting logic for multi_modal_kwargs hai 5 meses
  AlpinDale d907f20908 feat: support collective comms in XLA devices, e.g. TPUs hai 5 meses
  AlpinDale 42c66d5b00 feat: tensor parallelism for CPU backend hai 5 meses
  AlpinDale 8ade64c0cc fix: prevent possible data race by adding sync hai 5 meses
  AlpinDale f91991f584 fix: f-string fixes hai 5 meses
  AlpinDale 5289c14b24 feat: Asymmetric Tensor Parallel (#594) hai 5 meses
  AlpinDale dba22e4f83 fix: add zeromq fallback for broadcasting large objects (e.g. vlm images) hai 5 meses
  AlpinDale bdf1cc1aec fix: allow using custom all reduce when pp_size > 1 hai 5 meses
  AlpinDale ae04f57ec1 feat: Pipeline Parallel support (#581) hai 6 meses
  AlpinDale 4cdc810b1c fix: minor TP issues with vision models hai 6 meses
  AlpinDale 9868bb2290 chore: make it clear that '%' should NOT be in tensor dict keys hai 6 meses
  AlpinDale bb4da84623 fix: make sure multi modal kwargs can broadcast properly with ring buffer hai 6 meses
  AlpinDale bc5ac9584a fix: make tensor_dict flattening/unflattening more generic hai 6 meses
  AlpinDale abbb730607 feat: support draft model on different tensor parallel size hai 6 meses
  AlpinDale e238abf0cc chore: send and recv helper functions hai 6 meses
  AlpinDale 1b340083b1 feat: add shm broadcast hai 6 meses
  AlpinDale 6a57861fca feat: initial XPU support via intel_extension_for_pytorch (#571) hai 6 meses
  AlpinDale cc3486477e fix: benign multiprocessing error hai 6 meses
  AlpinDale 1d00b61622 feat: w4a16 support for compressed-tensors hai 6 meses
  AlpinDale 34b41e0a87 chore: add coordinator to reduce code duplication in tp and pp hai 6 meses
  AlpinDale 270bd333af chore: check if process is on the same node hai 6 meses
  AlpinDale 5b0c11d190 support pipeline parallel pynccl groups hai 6 meses
  AlpinDale b984fe4a91 refactor custom allreduce to support multiple tp groups hai 6 meses
  AlpinDale 8ae2cce237 refactor pynccl hai 6 meses
  AlpinDale 1879e32510 enable all-reduce for multiple tp groups hai 6 meses
  AlpinDale 4c746d8baa chore: init nccl using the gloo backend hai 7 meses
  AlpinDale 9d81716bfd [v0.5.3] Release Candidate (#388) hai 9 meses