AlpinDale
|
9d9722b1c1
fix: metrics endpoint with RPC server (#747)
|
3 months ago |
AlpinDale
|
2d044af0e1
chore: spawn engine process from api server process (#703)
|
4 months ago |
AlpinDale
|
1d3a1fec47
feat: add load/unload endpoints for soft-prompts (#694)
|
4 months ago |
AlpinDale
|
c34a6ac8e4
feat: add lora loading/unloading api endpoint (#693)
|
4 months ago |
AlpinDale
|
ed9a6f97c1
fix: kill api server when pinging dead engine (#660)
|
4 months ago |
AlpinDale
|
83bcb9119a
fix: multiprocessing timeout (#654)
|
4 months ago |
AlpinDale
|
a2344d3617
fix: move zeromq rpc frontend to IPC instead of TCP (#652)
|
4 months ago |
AlpinDale
|
59264d32e9
fix: hardcoded float16 in embedding mode check (#645)
|
4 months ago |
AlpinDale
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 months ago |
AlpinDale
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
8 months ago |
Krovius
|
205c8e4106
fix: kobold api /tokencount (#424)
|
8 months ago |
IggoOnCode
|
2aec297c55
feat: add embeddings endpoint to openai rest-api server. (#363)
|
9 months ago |
AlpinDale
|
f8dfac6372
chore: attention refactor and upstream sync apr01 (#365)
|
9 months ago |
Absurd
|
070c1cef8c
fix: explicit RFC3986 for prometheus_client asgi (#344)
|
9 months ago |
AlpinDale
|
398a97338a
feat: enable lora loading/unloading via API (#337)
|
9 months ago |
AlpinDale
|
e42a78381a
feat: switch from pylint to ruff (#322)
|
10 months ago |
drummerv
|
e59dd4a90d
fix: openai gguf chat template (#312)
|
10 months ago |
AlpinDale
|
c2d77b1822
chore: logging refactor (#302)
|
10 months ago |
Pyroserenus
|
951077de65
chore: update klite.embd with current version (#296)
|
10 months ago |
AlpinDale
|
ac82b67f75
feat: naive context shift and various QoL changes (#289)
|
10 months ago |
AlpinDale
|
f35d15e632
fix: arg detection for kobold api launch (#286)
|
10 months ago |
AlpinDale
|
23a7fd8cda
remove ooba endpoint, fix and add deprecation warning for kobold endpoint, fix case where kobold endpoint was always launched with openai (#284)
|
10 months ago |
AlpinDale
|
9fa99215f8
feat: add cubic sampling (#280)
|
10 months ago |
AlpinDale
|
657aec0cbd
refactor: OpenAI endpoint (#261)
|
10 months ago |
AlpinDale
|
4d04ade9ef
feat: fine-grained seeds (#279)
|
10 months ago |
swadical
|
0527131e93
fix: grammar logits processor (#268)
|
10 months ago |
AlpinDale
|
d2db4143fa
feat: add grafana for metrics (#240)
|
11 months ago |
AlpinDale
|
1c46fa31ad
feat: add quadratic sampling (#233)
|
11 months ago |
AlpinDale
|
0adab894fe
feat: grammar support (#206)
|
11 months ago |
AlpinDale
|
8fa608aeb7
feat: replace Ray with NCCL for control plane comms (#221)
|
11 months ago |