AlpinDale
|
9e73559eba
make use of batched rotary embedding kernels to support long context lora
|
8 ay önce |
AlpinDale
|
eaa06fdd14
fix some f-strings
|
8 ay önce |
AlpinDale
|
b55381df0e
speedup lora loading times by resuing the cpu dummy lora
|
8 ay önce |
AlpinDale
|
e87c32bed3
feat: full tensor parallel for LoRA layers (#545)
|
8 ay önce |
AlpinDale
|
8be299e78b
fix: lora load check
|
10 ay önce |
AlpinDale
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
10 ay önce |
AlpinDale
|
e3252edd07
fix: remove event and stream, add typing (#382)
|
11 ay önce |
AlpinDale
|
e42a78381a
feat: switch from pylint to ruff (#322)
|
1 yıl önce |
AlpinDale
|
657aec0cbd
refactor: OpenAI endpoint (#261)
|
1 yıl önce |
AlpinDale
|
697c06c4f5
fix: LoRA support for mixtral (#276)
|
1 yıl önce |
AlpinDale
|
c0aac15421
feat: S-LoRA support (#222)
|
1 yıl önce |