AlpinDale
|
4d04ade9ef
feat: fine-grained seeds (#279)
|
11 months ago |
Stefan Daniel Schwarz
|
810ca83066
fix+feat: docker compose (#264)
|
11 months ago |
AlpinDale
|
16615784b3
fix: prefix cache for turing gpus
|
11 months ago |
AlpinDale
|
7dc73a779a
fix: properly perform garbage collection for lora (#277)
|
11 months ago |
AlpinDale
|
697c06c4f5
fix: LoRA support for mixtral (#276)
|
11 months ago |
AlpinDale
|
4b80b42362
fix: memory leaks due to nccl cuda graphs (#275)
|
11 months ago |
AlpinDale
|
e31c6f0b45
feat: refactor modeling logic and support more models (#274)
|
11 months ago |
AlpinDale
|
7d6ba53602
feat: fused top-k kernels for MoE (#273)
|
11 months ago |
AlpinDale
|
a3cab09b69
chore: logging env variable
|
11 months ago |
AlpinDale
|
2c08aa5af4
chore: remove eos token from output (#272)
|
11 months ago |
AlpinDale
|
8e1cd54497
fix: do not include fp8 for rocm (#271)
|
11 months ago |
AlpinDale
|
6a63ab4ec3
fix: remote encode request if using ray (#270)
|
11 months ago |
AlpinDale
|
224b87b484
feat: add fused mixtral moe support (#238)
|
11 months ago |
Thomas Xin
|
43cf0e98a0
fix: worker initialization on WSL (#260)
|
11 months ago |
swadical
|
0527131e93
fix: grammar logits processor (#268)
|
11 months ago |
AlpinDale
|
2370dbcfd8
feat: OPT model support (#266)
|
11 months ago |
AlpinDale
|
4360684667
fix: cuda version in wheel
|
11 months ago |
TearGosling
|
80e8a14949
feat: add pygchat Jinja template (#218)
|
11 months ago |
sgsdxzy
|
fe7844f2ef
feat: sharding and safetensors support for gguf conversion (#256)
|
11 months ago |
AlpinDale
|
8635901c76
fix: s-lora vocab embeddings
|
11 months ago |
AlpinDale
|
c76b611021
docker: update the Dockerfile and push the latest image (#254)
|
11 months ago |
anon998
|
35b9033782
fix: crash in quadratic sampling when batch > 1 (#253)
|
11 months ago |
AlpinDale
|
842912d022
feat: on-the-fly gguf conversion (#250)
|
11 months ago |
AlpinDale
|
faca8745d6
fix: linting issue (#249)
|
11 months ago |
AlpinDale
|
3163839c88
bump version to 0.4.9
|
11 months ago |
AlpinDale
|
f99eb2c874
fix: hadamard tensors not included in wheel
|
11 months ago |
AlpinDale
|
8b6790d504
fix: gguf config not recognized
|
11 months ago |
AlpinDale
|
a1836a40e2
bump version to v0.4.8
|
11 months ago |
AlpinDale
|
2bd6c92f73
fix: lora inclusion in wheels
|
11 months ago |
AlpinDale
|
8da2be03ce
feat: bump version to v0.4.7 (#248)
|
11 months ago |