AlpinDale
|
52b1514650
api: add health check route
|
1 year ago |
AlpinDale
|
8f93acaf41
update readme
|
1 year ago |
AlpinDale
|
887e03669a
feat: add exllamav2 for GPTQ (#99)
|
1 year ago |
AlpinDale
|
161a69d2e0
Update requirements.txt
|
1 year ago |
AlpinDale
|
8834ecf9de
chore: clean up refactor endpoints (#98)
|
1 year ago |
AlpinDale
|
e113f3e9bf
feat: spaces between special tokens (#94)
|
1 year ago |
AlpinDale
|
ae7d8df224
fix lint issues (again)
|
1 year ago |
50h100a
|
fa0ae5a2c9
feat: new mirostatv2 implementation (#96)
|
1 year ago |
AlpinDale
|
6bfe75c6e9
fix: GPTQ
|
1 year ago |
AlpinDale
|
69204736de
Revert "fix: sync CPU delay in sampler (#93)"
|
1 year ago |
AlpinDale
|
1c988a48b2
fix logging and add py.typed
|
1 year ago |
AlpinDale
|
ce66e1df56
fix: sync CPU delay in sampler (#93)
|
1 year ago |
AlpinDale
|
11c01cfd0a
fix: skip special tokens
|
1 year ago |
AlpinDale
|
41341b220a
requirements: pin pydantic version
|
1 year ago |
g4rg
|
177fad1575
chore: remove unnecessary line (#81)
|
1 year ago |
AlpinDale
|
74604eb252
fix: pylint complaints (#91)
|
1 year ago |
AlpinDale
|
efc6f7fbec
chore: reformats (#90)
|
1 year ago |
AlpinDale
|
3d72f05c7b
feat: flattened 1D tensor -> 2D tensor (#85)
|
1 year ago |
AlpinDale
|
0dcc924088
readme: add benchmarks
|
1 year ago |
AlpinDale
|
eafcd75a69
fix tests and tweak throughput benchmark
|
1 year ago |
AlpinDale
|
2460d17e54
fix positional encoding tests
|
1 year ago |
AlpinDale
|
b047197513
tests: increase tolerance for layernorm
|
1 year ago |
AlpinDale
|
9b317aa26a
feat: finish up tests and workflows (#87)
|
1 year ago |
AlpinDale
|
f73f2dd3c2
feat: add mistral support for GPTQ (#86)
|
1 year ago |
AlpinDale
|
e6be0118c9
feat: prompt logprobs and batched samplers (#77)
|
1 year ago |
city-unit
|
12e296b556
fix: update Dockerfile (#82)
|
1 year ago |
city-unit
|
27af10fd0b
chore: make entrypoint executable (#83)
|
1 year ago |
AlpinDale
|
977e8d3507
update readme with new sampling params
|
1 year ago |
AlpinDale
|
ab1ac578ba
bump version to 0.3.7-dev
|
1 year ago |
Stefan Gligorijevic
|
34c1c8c83a
feat: Enable banning tokens (#80)
|
1 year ago |