.. |
ops
|
98f9dbd734
feat: Triton Kernels for Punica (#613)
|
4 月之前 |
__init__.py
|
2bd6c92f73
fix: lora inclusion in wheels
|
11 月之前 |
fully_sharded_layers.py
|
98f9dbd734
feat: Triton Kernels for Punica (#613)
|
4 月之前 |
layers.py
|
4d4e767838
ci: take one of fixing lint issues
|
4 月之前 |
lora.py
|
56e0b8223c
chore: add base class for LoRA-supported models
|
5 月之前 |
models.py
|
2a349ca3e1
fix: specify device when loading lora and embedding tensors
|
4 月之前 |
punica.py
|
98f9dbd734
feat: Triton Kernels for Punica (#613)
|
4 月之前 |
request.py
|
f92b9fc820
feat: support loading lora adapters directly from HF
|
4 月之前 |
utils.py
|
98f9dbd734
feat: Triton Kernels for Punica (#613)
|
4 月之前 |
worker_manager.py
|
f92b9fc820
feat: support loading lora adapters directly from HF
|
4 月之前 |