AlpinDale 5b0c11d190 support pipeline parallel pynccl groups 5 ヶ月 前
..
__init__.py 2bd6c92f73 fix: lora inclusion in wheels 11 ヶ月 前
fully_sharded_layers.py e87c32bed3 feat: full tensor parallel for LoRA layers (#545) 5 ヶ月 前
layers.py 9e73559eba make use of batched rotary embedding kernels to support long context lora 5 ヶ月 前
lora.py e87c32bed3 feat: full tensor parallel for LoRA layers (#545) 5 ヶ月 前
models.py 9e73559eba make use of batched rotary embedding kernels to support long context lora 5 ヶ月 前
punica.py e87c32bed3 feat: full tensor parallel for LoRA layers (#545) 5 ヶ月 前
request.py 5b0c11d190 support pipeline parallel pynccl groups 5 ヶ月 前
utils.py 9e73559eba make use of batched rotary embedding kernels to support long context lora 5 ヶ月 前
worker_manager.py 9e73559eba make use of batched rotary embedding kernels to support long context lora 5 ヶ月 前