AlpinDale cda0e93a10 abstract away the platform for device capability 7 meses atrás
..
__init__.py 2bd6c92f73 fix: lora inclusion in wheels 1 ano atrás
fully_sharded_layers.py 8a6e83b52e feat: fully sharded QKVParallelLinearWithLora support 7 meses atrás
layers.py 0f4a9ee77b quantized lm_head (#582) 7 meses atrás
lora.py 56e0b8223c chore: add base class for LoRA-supported models 7 meses atrás
models.py 0a6db357d8 fix: use safetensor keys instead of adapter_config.json to find unexpected modules 7 meses atrás
punica.py cda0e93a10 abstract away the platform for device capability 7 meses atrás
request.py 5b0c11d190 support pipeline parallel pynccl groups 7 meses atrás
utils.py 8a6e83b52e feat: fully sharded QKVParallelLinearWithLora support 7 meses atrás
worker_manager.py 25feb1d592 chore: add support for pinning lora adapters in the lru cache 7 meses atrás