AlpinDale
|
8a6e83b52e
feat: fully sharded QKVParallelLinearWithLora support
|
5 months ago |
AlpinDale
|
42d2ee0f43
chore: better error logging for unsupported lora weights
|
5 months ago |
AlpinDale
|
c975bba905
fix: sharded state loader with lora
|
6 months ago |
AlpinDale
|
9e73559eba
make use of batched rotary embedding kernels to support long context lora
|
6 months ago |
AlpinDale
|
e87c32bed3
feat: full tensor parallel for LoRA layers (#545)
|
6 months ago |
AlpinDale
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
8 months ago |
AlpinDale
|
657aec0cbd
refactor: OpenAI endpoint (#261)
|
11 months ago |
AlpinDale
|
c0aac15421
feat: S-LoRA support (#222)
|
1 year ago |