AlpinDale
|
68f050129d
fix: lora worker manager test import
|
1 month ago |
AlpinDale
|
3661de812d
fix: lora layer test
|
1 month ago |
AlpinDale
|
0a369f9171
feat: support chunked prefill with LoRA (#823)
|
1 month ago |
AlpinDale
|
e5b1afe625
feat: add chat method for LLM class (#822)
|
1 month ago |
AlpinDale
|
262cbc63b7
fix: vision api test template path
|
1 month ago |
AlpinDale
|
b0113a1eaa
fix: tokenization api test (#821)
|
1 month ago |
AlpinDale
|
c6c91edab7
ci: update & overhaul test units (#769)
|
1 month ago |
AlpinDale
|
f088ea81c7
fix: --max-seq-len-to-capture arg (#818)
|
1 month ago |
50h100a
|
a5346b2ea5
Merge pull request #814 from PygmalionAI/50h100a-temp-fix
|
1 month ago |
50h100a
|
273c61d406
guard against nan temperature from dynatemp (or anywhere else).
|
1 month ago |
50h100a
|
a22e887319
why we don't use the github website editor to make changes
|
1 month ago |
50h100a
|
54a8320638
logit shenanigans to prevent even worse shenanigans
|
1 month ago |
50h100a
|
b6a897d2a1
fix temperature, and address those pernicious dynatemp NaNs
|
1 month ago |
50h100a
|
a61d00fad7
Merge pull request #813 from PygmalionAI/50h100a-patch-1
|
1 month ago |
50h100a
|
83040c6389
Mask dynatemp using min/max, rather than exp
|
1 month ago |
AlpinDale
|
2fa112f86b
feat: update to serviceinfo v0.2 (#808)
|
2 months ago |
AlpinDale
|
72fbfa1b5b
feat: add serviceinfo endpoint (#807)
|
2 months ago |
AlpinDale
|
6e25b03f25
ci: docker build and upload script
|
2 months ago |
AlpinDale
|
6145deab4a
frontend: enable kobold api by default (#803)
|
2 months ago |
AlpinDale
|
f0e00f1b43
ci: bump to 0.6.3.post1 (#801)
|
2 months ago |
AlpinDale
|
5b0eabe0e8
fix: compilation of gptq_marlin_gemm object (#800)
|
2 months ago |
dependabot[bot]
|
7e2d2e7ae7
build(deps): bump rollup from 4.21.0 to 4.24.3 in /docs (#796)
|
2 months ago |
AlpinDale
|
76c05c5591
ci: bump version to 0.6.3 (#799)
|
2 months ago |
AlpinDale
|
0f1af04cf5
frontend: minor logging improvements (#787)
|
2 months ago |
AlpinDale
|
f98e7b2f8c
feat: add HQQ quantization support (#795)
|
2 months ago |
AlpinDale
|
43965f7bd9
fix: kobold lite embedded UI on windows (#797)
|
2 months ago |
AlpinDale
|
2d97f4014e
fix: windows wheel url (#794)
|
2 months ago |
AlpinDale
|
0256ed236b
feat: windows support (#790)
|
2 months ago |
AlpinDale
|
dcb794a340
fix: revert incorrect commit
|
2 months ago |
AlpinDale
|
76367b5ae7
wip
|
2 months ago |