AlpinDale c975bba905 fix: sharded state loader with lora 7 months ago
..
__init__.py 2bd6c92f73 fix: lora inclusion in wheels 1 year ago
fully_sharded_layers.py e87c32bed3 feat: full tensor parallel for LoRA layers (#545) 7 months ago
layers.py c975bba905 fix: sharded state loader with lora 7 months ago
lora.py e87c32bed3 feat: full tensor parallel for LoRA layers (#545) 7 months ago
models.py 9e73559eba make use of batched rotary embedding kernels to support long context lora 7 months ago
punica.py e87c32bed3 feat: full tensor parallel for LoRA layers (#545) 7 months ago
request.py 5b0c11d190 support pipeline parallel pynccl groups 7 months ago
utils.py c975bba905 fix: sharded state loader with lora 7 months ago
worker_manager.py 5fecc6b025 when was this deprecated? 7 months ago