AlpinDale cda0e93a10 abstract away the platform for device capability 7 ヶ月 前
..
__init__.py 2bd6c92f73 fix: lora inclusion in wheels 1 年間 前
fully_sharded_layers.py 8a6e83b52e feat: fully sharded QKVParallelLinearWithLora support 7 ヶ月 前
layers.py 0f4a9ee77b quantized lm_head (#582) 7 ヶ月 前
lora.py 56e0b8223c chore: add base class for LoRA-supported models 7 ヶ月 前
models.py 0a6db357d8 fix: use safetensor keys instead of adapter_config.json to find unexpected modules 7 ヶ月 前
punica.py cda0e93a10 abstract away the platform for device capability 7 ヶ月 前
request.py 5b0c11d190 support pipeline parallel pynccl groups 7 ヶ月 前
utils.py 8a6e83b52e feat: fully sharded QKVParallelLinearWithLora support 7 ヶ月 前
worker_manager.py 25feb1d592 chore: add support for pinning lora adapters in the lru cache 7 ヶ月 前