AlpinDale
|
4b80b42362
fix: memory leaks due to nccl cuda graphs (#275)
|
il y a 11 mois |
AlpinDale
|
641bb0f6e9
feat: add custom allreduce kernels (#224)
|
il y a 1 an |
AlpinDale
|
8fa608aeb7
feat: replace Ray with NCCL for control plane comms (#221)
|
il y a 1 an |
AlpinDale
|
74604eb252
fix: pylint complaints (#91)
|
il y a 1 an |
AlpinDale
|
a6a4220fa6
feat: refactor megatron and quants (#57)
|
il y a 1 an |