AlpinDale cda0e93a10 abstract away the platform for device capability hai 7 meses
..
__init__.py 2bd6c92f73 fix: lora inclusion in wheels hai 1 ano
fully_sharded_layers.py 8a6e83b52e feat: fully sharded QKVParallelLinearWithLora support hai 7 meses
layers.py 0f4a9ee77b quantized lm_head (#582) hai 7 meses
lora.py 56e0b8223c chore: add base class for LoRA-supported models hai 7 meses
models.py 0a6db357d8 fix: use safetensor keys instead of adapter_config.json to find unexpected modules hai 7 meses
punica.py cda0e93a10 abstract away the platform for device capability hai 7 meses
request.py 5b0c11d190 support pipeline parallel pynccl groups hai 7 meses
utils.py 8a6e83b52e feat: fully sharded QKVParallelLinearWithLora support hai 7 meses
worker_manager.py 25feb1d592 chore: add support for pinning lora adapters in the lru cache hai 7 meses