.. |
ops
|
98f9dbd734
feat: Triton Kernels for Punica (#613)
|
4 maanden geleden |
__init__.py
|
2bd6c92f73
fix: lora inclusion in wheels
|
11 maanden geleden |
fully_sharded_layers.py
|
98f9dbd734
feat: Triton Kernels for Punica (#613)
|
4 maanden geleden |
layers.py
|
4d4e767838
ci: take one of fixing lint issues
|
4 maanden geleden |
lora.py
|
56e0b8223c
chore: add base class for LoRA-supported models
|
5 maanden geleden |
models.py
|
2a349ca3e1
fix: specify device when loading lora and embedding tensors
|
4 maanden geleden |
punica.py
|
98f9dbd734
feat: Triton Kernels for Punica (#613)
|
4 maanden geleden |
request.py
|
f92b9fc820
feat: support loading lora adapters directly from HF
|
4 maanden geleden |
utils.py
|
98f9dbd734
feat: Triton Kernels for Punica (#613)
|
4 maanden geleden |
worker_manager.py
|
f92b9fc820
feat: support loading lora adapters directly from HF
|
4 maanden geleden |